de.danielnaber.languagetool.tokenizers.en
Class EnglishWordTokenizer

Object
  extended by EnglishWordTokenizer
All Implemented Interfaces:
Tokenizer

public class EnglishWordTokenizer
extends Object
implements Tokenizer

Tokenizes a sentence into words. Punctuation and whitespace gets its own token.

Author:
Daniel Naber

Constructor Summary
EnglishWordTokenizer()
           
 
Method Summary
 List<String> tokenize(String text)
           
 
Methods inherited from class Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

EnglishWordTokenizer

public EnglishWordTokenizer()
Method Detail

tokenize

public List<String> tokenize(String text)
Specified by:
tokenize in interface Tokenizer


Copyright © 2005-2007 Daniel Naber