- All Implemented Interfaces:
public class UkrainianWordTokenizer
Tokenizes a sentence into words.
Punctuation and whitespace gets its own token.
Specific to Ukrainian: apostrophes (0x27 and U+2019) not in the list as they are part of the word
- Andriy Rysin
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait