All Classes and Interfaces

Class
Description
Token filter that concatenates adjacent alphanumeric and numeric tokens.
Abstract base class for character filters that buffer input before processing.
Token filter that accepts tokens based on character type criteria.
Abstract base class for token filters that concatenate adjacent tokens.
Token filter that applies the Porter stemming algorithm with configurable steps.
Stemmer, implementing the Porter Stemming Algorithm The Stemmer class transforms a word into its root form.
Character filter that expands Japanese iteration marks (odoriji).
Normalizes Japanese numbers
Buffer that holds a Japanese number string and a position index used as a parsed-to marker
A token filter that concatenates tokens containing only numeric characters (digits).
A token filter that uses regular expression patterns to determine token concatenation behavior.
A token filter that determines concatenation behavior based on part-of-speech (POS) tags.
Functional interface that supplies part-of-speech (POS) tag information for the current token.
A character filter that normalizes various dash and hyphen characters to Japanese prolonged sound marks when they appear after Hiragana, Katakana, or Katakana phonetic extension characters.
A keyword marker filter that can dynamically reload its keyword set from a file.
A stop word filter that can dynamically reload its stop word set from a file.
Abstract base class for stop token filters that match tokens against a word list.
A stop token filter that removes tokens beginning with any of the specified prefix words.
A stop token filter that removes tokens ending with any of the specified suffix words.