Class UnknownDictionary
- java.lang.Object
-
- org.apache.lucene.analysis.ko.dict.BinaryDictionary
-
- org.apache.lucene.analysis.ko.dict.UnknownDictionary
-
- All Implemented Interfaces:
Dictionary
public final class UnknownDictionary extends BinaryDictionary
Dictionary for unknown-word handling.
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from class org.apache.lucene.analysis.ko.dict.BinaryDictionary
BinaryDictionary.ResourceScheme
-
Nested classes/interfaces inherited from interface org.apache.lucene.analysis.ko.dict.Dictionary
Dictionary.Morpheme
-
-
Field Summary
-
Fields inherited from class org.apache.lucene.analysis.ko.dict.BinaryDictionary
DICT_FILENAME_SUFFIX, DICT_HEADER, HAS_READING, HAS_SINGLE_POS, POSDICT_FILENAME_SUFFIX, POSDICT_HEADER, TARGETMAP_FILENAME_SUFFIX, TARGETMAP_HEADER, VERSION
-
-
Constructor Summary
Constructors Constructor Description UnknownDictionary(URL targetMapUrl, URL posDictUrl, URL dictUrl)Create aUnknownDictionaryfrom an external resource URL (e.g.UnknownDictionary(Path targetMapFile, Path posDictFile, Path dictFile)Create aUnknownDictionaryfrom an external resource path.UnknownDictionary(BinaryDictionary.ResourceScheme scheme, String resourcePath)Deprecated, for removal: This API element is subject to removal in a future version.replaced byUnknownDictionary(Path, Path, Path)for files andUnknownDictionary(URL, URL, URL)for classpath/module resources
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description CharacterDefinitiongetCharacterDefinition()static UnknownDictionarygetInstance()Dictionary.Morpheme[]getMorphemes(int wordId, char[] surfaceForm, int off, int len)Get the morphemes of specified word (e.g.StringgetReading(int wordId)Get the reading of specified word (mainly used for Hanja to Hangul conversion).-
Methods inherited from class org.apache.lucene.analysis.ko.dict.BinaryDictionary
getLeftId, getLeftPOS, getPOSType, getResource, getRightId, getRightPOS, getWordCost, lookupWordIds
-
-
-
-
Constructor Detail
-
UnknownDictionary
@Deprecated(forRemoval=true, since="9.1") public UnknownDictionary(BinaryDictionary.ResourceScheme scheme, String resourcePath) throws IOException
Deprecated, for removal: This API element is subject to removal in a future version.replaced byUnknownDictionary(Path, Path, Path)for files andUnknownDictionary(URL, URL, URL)for classpath/module resources- Parameters:
scheme- scheme for loading resources (FILE or CLASSPATH).resourcePath- where to load resources from; a path, including the file base name without extension; this is used to match multiple files with the same base name.- Throws:
IOException
-
UnknownDictionary
public UnknownDictionary(Path targetMapFile, Path posDictFile, Path dictFile) throws IOException
Create aUnknownDictionaryfrom an external resource path.- Parameters:
targetMapFile- where to load target map resourceposDictFile- where to load POS dictionary resourcedictFile- where to load dictionary entries resource- Throws:
IOException- if resource was not found or broken
-
UnknownDictionary
public UnknownDictionary(URL targetMapUrl, URL posDictUrl, URL dictUrl) throws IOException
Create aUnknownDictionaryfrom an external resource URL (e.g. from Classpath withClassLoader.getResource(String)).- Parameters:
targetMapUrl- where to load target map resourceposDictUrl- where to load POS dictionary resourcedictUrl- where to load dictionary entries resource- Throws:
IOException- if resource was not found or broken
-
-
Method Detail
-
getCharacterDefinition
public CharacterDefinition getCharacterDefinition()
-
getInstance
public static UnknownDictionary getInstance()
-
getReading
public String getReading(int wordId)
Description copied from interface:DictionaryGet the reading of specified word (mainly used for Hanja to Hangul conversion).- Specified by:
getReadingin interfaceDictionary- Overrides:
getReadingin classBinaryDictionary
-
getMorphemes
public Dictionary.Morpheme[] getMorphemes(int wordId, char[] surfaceForm, int off, int len)
Description copied from interface:DictionaryGet the morphemes of specified word (e.g. 가깝으나: 가깝 + 으나).- Specified by:
getMorphemesin interfaceDictionary- Overrides:
getMorphemesin classBinaryDictionary
-
-