Package org.apache.lucene.classification
Class BM25NBClassifier
java.lang.Object
org.apache.lucene.classification.BM25NBClassifier
- All Implemented Interfaces:
- Classifier<BytesRef>
A classifier approximating naive bayes classifier by using pure queries on BM25.
- WARNING: This API is experimental and might change in incompatible ways in the next release.
- 
Constructor SummaryConstructorsConstructorDescriptionBM25NBClassifier(IndexReader indexReader, Analyzer analyzer, Query query, String classFieldName, String... textFieldNames) Creates a new NaiveBayes classifier.
- 
Method SummaryModifier and TypeMethodDescriptionassignClass(String inputDocument) Assign a class (with score) to the given text StringgetClasses(String text) Get all the classes (sorted by score, descending) assigned to the given text String.getClasses(String text, int max) Get the firstmaxclasses (sorted by score, descending) assigned to the given text String.
- 
Constructor Details- 
BM25NBClassifierpublic BM25NBClassifier(IndexReader indexReader, Analyzer analyzer, Query query, String classFieldName, String... textFieldNames) Creates a new NaiveBayes classifier.- Parameters:
- indexReader- the reader on the index to be used for classification
- analyzer- an- Analyzerused to analyze unseen text
- query- a- Queryto eventually filter the docs used for training the classifier, or- nullif all the indexed docs should be used
- classFieldName- the name of the field used as the output for the classifier NOTE: must not be heavely analyzed as the returned class will be a token indexed for this field
- textFieldNames- the name of the fields used as the inputs for the classifier, NO boosting supported per field
 
 
- 
- 
Method Details- 
assignClassDescription copied from interface:ClassifierAssign a class (with score) to the given text String- Specified by:
- assignClassin interface- Classifier<BytesRef>
- Parameters:
- inputDocument- a String containing text to be classified
- Returns:
- a ClassificationResultholding assigned class of typeTand score
- Throws:
- IOException- If there is a low-level I/O error.
 
- 
getClassesDescription copied from interface:ClassifierGet all the classes (sorted by score, descending) assigned to the given text String.- Specified by:
- getClassesin interface- Classifier<BytesRef>
- Parameters:
- text- a String containing text to be classified
- Returns:
- the whole list of ClassificationResult, the classes and scores. Returnsnullif the classifier can't make lists.
- Throws:
- IOException- If there is a low-level I/O error.
 
- 
getClassesDescription copied from interface:ClassifierGet the firstmaxclasses (sorted by score, descending) assigned to the given text String.- Specified by:
- getClassesin interface- Classifier<BytesRef>
- Parameters:
- text- a String containing text to be classified
- max- the number of return list elements
- Returns:
- the whole list of ClassificationResult, the classes and scores. Cut for "max" number of elements. Returnsnullif the classifier can't make lists.
- Throws:
- IOException- If there is a low-level I/O error.
 
 
-