de.danielnaber.languagetool.tagging
Interface Tagger

All Known Implementing Classes:
BaseTagger, CzechTagger, DemoTagger, DutchTagger, EnglishTagger, FrenchTagger, GermanTagger, ItalianTagger, PolishTagger, RussianTagger, SpanishTagger, SwedishTagger, UkrainianMorfoTagger, UkrainianMyspellTagger, UkrainianTagger

public interface Tagger

The part-of-speech tagger interface, whose implementions are usually language-dependent.

Author:
Daniel Naber

Method Summary
 Object createNullToken(String token, int startPos)
          Create the AnalyzedToken used for whitespace and other non-words.
 List<AnalyzedTokenReadings> tag(List<String> sentenceTokens)
          Returns a list of AnalyzedTokens that assigns each term in the sentence some kind of part-of-speech information (not necessarily just one tag).
 

Method Detail

tag

List<AnalyzedTokenReadings> tag(List<String> sentenceTokens)
                                throws IOException
Returns a list of AnalyzedTokens that assigns each term in the sentence some kind of part-of-speech information (not necessarily just one tag).

Note that this method takes exactly one sentence. Its implementation may implement special cases for the first word of a sentence, which is usually written with an uppercase letter.

Parameters:
sentenceTokens - the text as returned by a WordTokenizer but whithout whitespace tokens.
Throws:
IOException

createNullToken

Object createNullToken(String token,
                       int startPos)
Create the AnalyzedToken used for whitespace and other non-words. Use null as the POS tag for this token.



Copyright © 2005-2007 Daniel Naber