Class BinaryDictionary
- java.lang.Object
-
- org.apache.lucene.analysis.ko.dict.BinaryDictionary
-
- All Implemented Interfaces:
Dictionary
- Direct Known Subclasses:
TokenInfoDictionary,UnknownDictionary
public abstract class BinaryDictionary extends Object implements Dictionary
Base class for a binary-encoded in-memory dictionary.
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classBinaryDictionary.ResourceSchemeDeprecated, for removal: This API element is subject to removal in a future version.-
Nested classes/interfaces inherited from interface org.apache.lucene.analysis.ko.dict.Dictionary
Dictionary.Morpheme
-
-
Field Summary
Fields Modifier and Type Field Description static StringDICT_FILENAME_SUFFIXstatic StringDICT_HEADERstatic intHAS_READINGflag that the entry has reading data.static intHAS_SINGLE_POSflag that the entry has a single part of speech (leftPOS)static StringPOSDICT_FILENAME_SUFFIXstatic StringPOSDICT_HEADERstatic StringTARGETMAP_FILENAME_SUFFIXstatic StringTARGETMAP_HEADERstatic intVERSION
-
Constructor Summary
Constructors Modifier Constructor Description protectedBinaryDictionary(IOSupplier<InputStream> targetMapResource, IOSupplier<InputStream> posResource, IOSupplier<InputStream> dictResource)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Deprecated Methods Modifier and Type Method Description intgetLeftId(int wordId)Get left id of specified wordPOS.TaggetLeftPOS(int wordId)Get the leftPOS.Tagof specfied word.Dictionary.Morpheme[]getMorphemes(int wordId, char[] surfaceForm, int off, int len)Get the morphemes of specified word (e.g.POS.TypegetPOSType(int wordId)Get thePOS.Typeof specified word (morpheme, compound, inflect or pre-analysis)StringgetReading(int wordId)Get the reading of specified word (mainly used for Hanja to Hangul conversion).static InputStreamgetResource(BinaryDictionary.ResourceScheme scheme, String path)Deprecated, for removal: This API element is subject to removal in a future version.intgetRightId(int wordId)Get right id of specified wordPOS.TaggetRightPOS(int wordId)Get the rightPOS.Tagof specfied word.intgetWordCost(int wordId)Get word cost of specified wordvoidlookupWordIds(int sourceId, IntsRef ref)
-
-
-
Field Detail
-
TARGETMAP_FILENAME_SUFFIX
public static final String TARGETMAP_FILENAME_SUFFIX
- See Also:
- Constant Field Values
-
DICT_FILENAME_SUFFIX
public static final String DICT_FILENAME_SUFFIX
- See Also:
- Constant Field Values
-
POSDICT_FILENAME_SUFFIX
public static final String POSDICT_FILENAME_SUFFIX
- See Also:
- Constant Field Values
-
DICT_HEADER
public static final String DICT_HEADER
- See Also:
- Constant Field Values
-
TARGETMAP_HEADER
public static final String TARGETMAP_HEADER
- See Also:
- Constant Field Values
-
POSDICT_HEADER
public static final String POSDICT_HEADER
- See Also:
- Constant Field Values
-
VERSION
public static final int VERSION
- See Also:
- Constant Field Values
-
HAS_SINGLE_POS
public static final int HAS_SINGLE_POS
flag that the entry has a single part of speech (leftPOS)- See Also:
- Constant Field Values
-
HAS_READING
public static final int HAS_READING
flag that the entry has reading data. otherwise reading is surface form- See Also:
- Constant Field Values
-
-
Constructor Detail
-
BinaryDictionary
protected BinaryDictionary(IOSupplier<InputStream> targetMapResource, IOSupplier<InputStream> posResource, IOSupplier<InputStream> dictResource) throws IOException
- Throws:
IOException
-
-
Method Detail
-
getResource
@Deprecated(forRemoval=true, since="9.1") public static final InputStream getResource(BinaryDictionary.ResourceScheme scheme, String path) throws IOException
Deprecated, for removal: This API element is subject to removal in a future version.- Throws:
IOException
-
lookupWordIds
public void lookupWordIds(int sourceId, IntsRef ref)
-
getLeftId
public int getLeftId(int wordId)
Description copied from interface:DictionaryGet left id of specified word- Specified by:
getLeftIdin interfaceDictionary
-
getRightId
public int getRightId(int wordId)
Description copied from interface:DictionaryGet right id of specified word- Specified by:
getRightIdin interfaceDictionary
-
getWordCost
public int getWordCost(int wordId)
Description copied from interface:DictionaryGet word cost of specified word- Specified by:
getWordCostin interfaceDictionary
-
getPOSType
public POS.Type getPOSType(int wordId)
Description copied from interface:DictionaryGet thePOS.Typeof specified word (morpheme, compound, inflect or pre-analysis)- Specified by:
getPOSTypein interfaceDictionary
-
getLeftPOS
public POS.Tag getLeftPOS(int wordId)
Description copied from interface:DictionaryGet the leftPOS.Tagof specfied word.For
POS.Type.MORPHEMEandPOS.Type.COMPOUNDthe left and right POS are the same.- Specified by:
getLeftPOSin interfaceDictionary
-
getRightPOS
public POS.Tag getRightPOS(int wordId)
Description copied from interface:DictionaryGet the rightPOS.Tagof specfied word.For
POS.Type.MORPHEMEandPOS.Type.COMPOUNDthe left and right POS are the same.- Specified by:
getRightPOSin interfaceDictionary
-
getReading
public String getReading(int wordId)
Description copied from interface:DictionaryGet the reading of specified word (mainly used for Hanja to Hangul conversion).- Specified by:
getReadingin interfaceDictionary
-
getMorphemes
public Dictionary.Morpheme[] getMorphemes(int wordId, char[] surfaceForm, int off, int len)
Description copied from interface:DictionaryGet the morphemes of specified word (e.g. 가깝으나: 가깝 + 으나).- Specified by:
getMorphemesin interfaceDictionary
-
-