Class UnknownDictionary
- java.lang.Object
-
- org.apache.lucene.analysis.ja.dict.BinaryDictionary
-
- org.apache.lucene.analysis.ja.dict.UnknownDictionary
-
- All Implemented Interfaces:
Dictionary
public final class UnknownDictionary extends BinaryDictionary
Dictionary for unknown-word handling.
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
BinaryDictionary.ResourceScheme
-
-
Field Summary
-
Fields inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
DICT_FILENAME_SUFFIX, DICT_HEADER, HAS_BASEFORM, HAS_PRONUNCIATION, HAS_READING, POSDICT_FILENAME_SUFFIX, POSDICT_HEADER, TARGETMAP_FILENAME_SUFFIX, TARGETMAP_HEADER, VERSION
-
Fields inherited from interface org.apache.lucene.analysis.ja.dict.Dictionary
INTERNAL_SEPARATOR
-
-
Constructor Summary
Constructors Constructor Description UnknownDictionary(URL targetMapUrl, URL posDictUrl, URL dictUrl)Create aUnknownDictionaryfrom an external resource URL (e.g.UnknownDictionary(Path targetMapFile, Path posDictFile, Path dictFile)Create aUnknownDictionaryfrom an external resource path.UnknownDictionary(BinaryDictionary.ResourceScheme scheme, String path)Deprecated, for removal: This API element is subject to removal in a future version.replaced byUnknownDictionary(Path, Path, Path)for files andUnknownDictionary(URL, URL, URL)for classpath/module resources
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description CharacterDefinitiongetCharacterDefinition()StringgetInflectionForm(int wordId)Get inflection form of tokensStringgetInflectionType(int wordId)Get inflection type of tokensstatic UnknownDictionarygetInstance()StringgetReading(int wordId, char[] surface, int off, int len)Get reading of tokensintlookup(char[] text, int offset, int len)-
Methods inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getBaseForm, getLeftId, getPartOfSpeech, getPronunciation, getResource, getRightId, getWordCost, lookupWordIds
-
-
-
-
Constructor Detail
-
UnknownDictionary
@Deprecated(forRemoval=true, since="9.1") public UnknownDictionary(BinaryDictionary.ResourceScheme scheme, String path) throws IOException
Deprecated, for removal: This API element is subject to removal in a future version.replaced byUnknownDictionary(Path, Path, Path)for files andUnknownDictionary(URL, URL, URL)for classpath/module resources- Parameters:
scheme- scheme for loading resources (FILE or CLASSPATH).path- where to load resources from; a path, including the file base name without extension; this is used to match multiple files with the same base name.- Throws:
IOException
-
UnknownDictionary
public UnknownDictionary(Path targetMapFile, Path posDictFile, Path dictFile) throws IOException
Create aUnknownDictionaryfrom an external resource path.- Parameters:
targetMapFile- where to load target map resourceposDictFile- where to load POS dictionary resourcedictFile- where to load dictionary entries resource- Throws:
IOException- if resource was not found or broken
-
UnknownDictionary
public UnknownDictionary(URL targetMapUrl, URL posDictUrl, URL dictUrl) throws IOException
Create aUnknownDictionaryfrom an external resource URL (e.g. from Classpath withClassLoader.getResource(String)).- Parameters:
targetMapUrl- where to load target map resourceposDictUrl- where to load POS dictionary resourcedictUrl- where to load dictionary entries resource- Throws:
IOException- if resource was not found or broken
-
-
Method Detail
-
lookup
public int lookup(char[] text, int offset, int len)
-
getCharacterDefinition
public CharacterDefinition getCharacterDefinition()
-
getReading
public String getReading(int wordId, char[] surface, int off, int len)
Description copied from interface:DictionaryGet reading of tokens- Specified by:
getReadingin interfaceDictionary- Overrides:
getReadingin classBinaryDictionary- Parameters:
wordId- word ID of token- Returns:
- Reading of the token
-
getInflectionType
public String getInflectionType(int wordId)
Description copied from interface:DictionaryGet inflection type of tokens- Specified by:
getInflectionTypein interfaceDictionary- Overrides:
getInflectionTypein classBinaryDictionary- Parameters:
wordId- word ID of token- Returns:
- inflection type, or null
-
getInflectionForm
public String getInflectionForm(int wordId)
Description copied from interface:DictionaryGet inflection form of tokens- Specified by:
getInflectionFormin interfaceDictionary- Overrides:
getInflectionFormin classBinaryDictionary- Parameters:
wordId- word ID of token- Returns:
- inflection form, or null
-
getInstance
public static UnknownDictionary getInstance()
-
-