Class JapaneseIterationMarkCharFilter
- All Implemented Interfaces:
- Closeable,- AutoCloseable,- Readable
Sequences of iteration marks are supported. In case an illegal sequence of iteration marks is encountered, the implementation emits the illegal source character as-is without considering its script. For example, with input "?ゝ", we get "??" even though the question mark isn't hiragana.
Note that a full stop punctuation character "。" (U+3002) can not be iterated (see below). Iteration marks themselves can be emitted in case they are illegal, i.e. if they go back past the beginning of the character stream.
The implementation buffers input until a full stop punctuation character (U+3002) or EOF is reached in order to not keep a copy of the character stream in memory. Vertical iteration marks, which are even rarer than horizontal iteration marks in contemporary Japanese, are unsupported.
- 
Field SummaryFieldsModifier and TypeFieldDescriptionstatic final booleanNormalize kana iteration marks by defaultstatic final booleanNormalize kanji iteration marks by defaultFields inherited from class org.apache.lucene.analysis.CharFilterinput
- 
Constructor SummaryConstructorsConstructorDescriptionConstructor.JapaneseIterationMarkCharFilter(Reader input, boolean normalizeKanji, boolean normalizeKana) Constructor
- 
Method SummaryMethods inherited from class org.apache.lucene.analysis.CharFilterclose, correctOffsetMethods inherited from class java.io.Readermark, markSupported, nullReader, read, read, ready, reset, skip, transferTo
- 
Field Details- 
NORMALIZE_KANJI_DEFAULTpublic static final boolean NORMALIZE_KANJI_DEFAULTNormalize kanji iteration marks by default- See Also:
 
- 
NORMALIZE_KANA_DEFAULTpublic static final boolean NORMALIZE_KANA_DEFAULTNormalize kana iteration marks by default- See Also:
 
 
- 
- 
Constructor Details- 
JapaneseIterationMarkCharFilterConstructor. Normalizes both kanji and kana iteration marks by default.- Parameters:
- input- char stream
 
- 
JapaneseIterationMarkCharFilterConstructor- Parameters:
- input- char stream
- normalizeKanji- indicates whether kanji iteration marks should be normalized
- normalizeKana- indicates whether kana iteration marks should be normalized
 
 
- 
- 
Method Details- 
read- Specified by:
- readin class- Reader
- Throws:
- IOException
 
- 
read- Overrides:
- readin class- Reader
- Throws:
- IOException
 
- 
correctprotected int correct(int currentOff) - Specified by:
- correctin class- CharFilter
 
 
-