Class MorfologikFilter
- java.lang.Object
-
- org.apache.lucene.util.AttributeSource
-
- org.apache.lucene.analysis.TokenStream
-
- org.apache.lucene.analysis.TokenFilter
-
- org.apache.lucene.analysis.morfologik.MorfologikFilter
-
- All Implemented Interfaces:
Closeable,AutoCloseable,Unwrappable<TokenStream>
public class MorfologikFilter extends TokenFilter
TokenFilterusing Morfologik library to transform input tokens into lemma and morphosyntactic (POS) tokens. Applies to Polish only.MorfologikFilter contains a
MorphosyntacticTagsAttribute, which provides morphosyntactic annotations for produced lemmas. See the Morfologik documentation for details.- See Also:
- Morfologik project page
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
AttributeSource.State
-
-
Field Summary
-
Fields inherited from class org.apache.lucene.analysis.TokenStream
DEFAULT_TOKEN_ATTRIBUTE_FACTORY
-
-
Constructor Summary
Constructors Constructor Description MorfologikFilter(TokenStream in)Creates a filter with the default (Polish) dictionary.MorfologikFilter(TokenStream in, morfologik.stemming.Dictionary dict)Creates a filter with a given dictionary.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description booleanincrementToken()Retrieves the next token (possibly from the list of lemmas).voidreset()Resets stems accumulator and hands over to superclass.-
Methods inherited from class org.apache.lucene.analysis.TokenFilter
close, end, unwrap
-
Methods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, endAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toString
-
-
-
-
Constructor Detail
-
MorfologikFilter
public MorfologikFilter(TokenStream in)
Creates a filter with the default (Polish) dictionary.
-
MorfologikFilter
public MorfologikFilter(TokenStream in, morfologik.stemming.Dictionary dict)
Creates a filter with a given dictionary.- Parameters:
in- input token stream.dict- Dictionary to use for stemming.
-
-
Method Detail
-
incrementToken
public final boolean incrementToken() throws IOExceptionRetrieves the next token (possibly from the list of lemmas).- Specified by:
incrementTokenin classTokenStream- Throws:
IOException
-
reset
public void reset() throws IOExceptionResets stems accumulator and hands over to superclass.- Overrides:
resetin classTokenFilter- Throws:
IOException
-
-