Package org.apache.lucene.classification.document
Uses already seen data (the indexed documents) to classify new documents.
Currently contains a (simplistic) Naive Bayes classifier and a k-Nearest Neighbor classifier.
-
Interface Summary Interface Description DocumentClassifier<T> A classifier, seehttp://en.wikipedia.org/wiki/Classifier_(mathematics), which assign classes of typeTto aDocuments -
Class Summary Class Description KNearestNeighborDocumentClassifier A k-Nearest Neighbor Document classifier (seehttp://en.wikipedia.org/wiki/K-nearest_neighbors) based onMoreLikeThis.SimpleNaiveBayesDocumentClassifier A simplistic Lucene based NaiveBayes classifier, seehttp://en.wikipedia.org/wiki/Naive_Bayes_classifier