de.danielnaber.languagetool.tagging
Class BaseTagger

Object
  extended by BaseTagger
All Implemented Interfaces:
Tagger
Direct Known Subclasses:
CzechTagger, DutchTagger, EnglishTagger, FrenchTagger, ItalianTagger, PolishTagger, RussianTagger, SpanishTagger, SwedishTagger, UkrainianMorfoTagger

public abstract class BaseTagger
extends Object
implements Tagger

Base tagger using Lametyzator.

Author:
Marcin Milkowski

Constructor Summary
BaseTagger()
           
 
Method Summary
 Object createNullToken(String token, int startPos)
          Create the AnalyzedToken used for whitespace and other non-words.
abstract  void setFileName()
          Set the filename in a JAR, eg.
 List<AnalyzedTokenReadings> tag(List<String> sentenceTokens)
          Returns a list of AnalyzedTokens that assigns each term in the sentence some kind of part-of-speech information (not necessarily just one tag).
 
Methods inherited from class Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

BaseTagger

public BaseTagger()
Method Detail

setFileName

public abstract void setFileName()
Set the filename in a JAR, eg. /resource/fr/french.dict.


tag

public List<AnalyzedTokenReadings> tag(List<String> sentenceTokens)
                                throws IOException
Description copied from interface: Tagger
Returns a list of AnalyzedTokens that assigns each term in the sentence some kind of part-of-speech information (not necessarily just one tag).

Note that this method takes exactly one sentence. Its implementation may implement special cases for the first word of a sentence, which is usually written with an uppercase letter.

Specified by:
tag in interface Tagger
Parameters:
sentenceTokens - the text as returned by a WordTokenizer but whithout whitespace tokens.
Throws:
IOException

createNullToken

public final Object createNullToken(String token,
                                    int startPos)
Description copied from interface: Tagger
Create the AnalyzedToken used for whitespace and other non-words. Use null as the POS tag for this token.

Specified by:
createNullToken in interface Tagger


Copyright © 2005-2007 Daniel Naber