public class SentenceTokenizer extends Object implements Tokenizer
| Constructor and Description |
|---|
SentenceTokenizer()
Create a sentence tokenizer that uses the built-in abbreviations.
|
SentenceTokenizer(String[] abbrevList)
Create a sentence tokenizer with the given list of abbreviations,
additionally to the built-in ones.
|
| Modifier and Type | Method and Description |
|---|---|
void |
setSingleLineBreaksMarksParagraph(boolean lineBreakParagraphs) |
boolean |
singleLineBreaksMarksPara() |
List<String> |
tokenize(String s)
Tokenize the given string to sentences.
|
public SentenceTokenizer()
public SentenceTokenizer(String[] abbrevList)
public void setSingleLineBreaksMarksParagraph(boolean lineBreakParagraphs)
lineBreakParagraphs - if true, single lines breaks are assumed to end a paragraph,
with false, only two ore more consecutive line breaks end a paragraphpublic boolean singleLineBreaksMarksPara()
Copyright © 2005-2011 Daniel Naber