Package org.apache.lucene.misc
package org.apache.lucene.misc
Miscellaneous index tools.
-
ClassesClassDescriptionUtility to get document frequency and total number of occurrences (sum of the tf for each doc) of a term.
HighFreqTerms
class extracts the top n most frequent terms (by document frequency) from an existing Lucene index and reports their document frequency.Compares terms by docTermFreqCompares terms by totalTermFreqMerges indices specified on the command line into the index specified as the first command line argument.A similarity with a lengthNorm that provides for a "plateau" of equally good lengths, and tf helper functions.Holder for a term along with its statistics (TermStats.docFreq
andTermStats.totalTermFreq
).