Package org.apache.lucene.analysis.fi
Class FinnishAnalyzer
java.lang.Object
org.apache.lucene.analysis.Analyzer
org.apache.lucene.analysis.StopwordAnalyzerBase
org.apache.lucene.analysis.fi.FinnishAnalyzer
- All Implemented Interfaces:
- Closeable,- AutoCloseable
Analyzer for Finnish.- Since:
- 3.1
- 
Nested Class SummaryNested classes/interfaces inherited from class org.apache.lucene.analysis.AnalyzerAnalyzer.ReuseStrategy, Analyzer.TokenStreamComponents
- 
Field SummaryFieldsFields inherited from class org.apache.lucene.analysis.StopwordAnalyzerBasestopwordsFields inherited from class org.apache.lucene.analysis.AnalyzerGLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY
- 
Constructor SummaryConstructorsConstructorDescriptionBuilds an analyzer with the default stop words:DEFAULT_STOPWORD_FILE.FinnishAnalyzer(CharArraySet stopwords) Builds an analyzer with the given stop words.FinnishAnalyzer(CharArraySet stopwords, CharArraySet stemExclusionSet) Builds an analyzer with the given stop words.
- 
Method SummaryModifier and TypeMethodDescriptionprotected Analyzer.TokenStreamComponentscreateComponents(String fieldName) Creates aAnalyzer.TokenStreamComponentswhich tokenizes all the text in the providedReader.static CharArraySetReturns an unmodifiable instance of the default stop words set.protected TokenStreamnormalize(String fieldName, TokenStream in) Methods inherited from class org.apache.lucene.analysis.StopwordAnalyzerBasegetStopwordSet, loadStopwordSet, loadStopwordSetMethods inherited from class org.apache.lucene.analysis.AnalyzerattributeFactory, close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, initReader, initReaderForNormalization, normalize, tokenStream, tokenStream
- 
Field Details- 
DEFAULT_STOPWORD_FILEFile containing default Italian stopwords.- See Also:
 
 
- 
- 
Constructor Details- 
FinnishAnalyzerpublic FinnishAnalyzer()Builds an analyzer with the default stop words:DEFAULT_STOPWORD_FILE.
- 
FinnishAnalyzerBuilds an analyzer with the given stop words.- Parameters:
- stopwords- a stopword set
 
- 
FinnishAnalyzerBuilds an analyzer with the given stop words. If a non-empty stem exclusion set is provided this analyzer will add aSetKeywordMarkerFilterbefore stemming.- Parameters:
- stopwords- a stopword set
- stemExclusionSet- a set of terms not to be stemmed
 
 
- 
- 
Method Details- 
getDefaultStopSetReturns an unmodifiable instance of the default stop words set.- Returns:
- default stop words set.
 
- 
createComponentsCreates aAnalyzer.TokenStreamComponentswhich tokenizes all the text in the providedReader.- Specified by:
- createComponentsin class- Analyzer
- Returns:
- A Analyzer.TokenStreamComponentsbuilt from anStandardTokenizerfiltered withLowerCaseFilter,StopFilter,SetKeywordMarkerFilterif a stem exclusion set is provided andSnowballFilter.
 
- 
normalize
 
-