All Classes and Interfaces (Lucene Analyzers for Japanese 10.3.1.0 API)

Class

Description

AlphaNumWordFilter

Token filter that concatenates adjacent alphanumeric and numeric tokens.

BufferedCharFilter

Abstract base class for character filters that buffer input before processing.

CharTypeFilter

Token filter that accepts tokens based on character type criteria.

ConcatenationFilter

Abstract base class for token filters that concatenate adjacent tokens.

FlexiblePorterStemFilter

Token filter that applies the Porter stemming algorithm with configurable steps.

FlexiblePorterStemmer

Stemmer, implementing the Porter Stemming Algorithm The Stemmer class transforms a word into its root form.

IterationMarkCharFilter

Character filter that expands Japanese iteration marks (odoriji).

KanjiNumberFilter

Normalizes Japanese numbers

KanjiNumberFilter.NumberBuffer

Buffer that holds a Japanese number string and a position index used as a parsed-to marker

NumberConcatenationFilter

A token filter that concatenates tokens containing only numeric characters (digits).

PatternConcatenationFilter

A token filter that uses regular expression patterns to determine token concatenation behavior.

PosConcatenationFilter

A token filter that determines concatenation behavior based on part-of-speech (POS) tags.

PosConcatenationFilter.PartOfSpeechSupplier

Functional interface that supplies part-of-speech (POS) tag information for the current token.

ProlongedSoundMarkCharFilter

A character filter that normalizes various dash and hyphen characters to Japanese prolonged sound marks when they appear after Hiragana, Katakana, or Katakana phonetic extension characters.

ReloadableKeywordMarkerFilter

A keyword marker filter that can dynamically reload its keyword set from a file.

ReloadableStopFilter

A stop word filter that can dynamically reload its stop word set from a file.

StopTokenFilter

Abstract base class for stop token filters that match tokens against a word list.

StopTokenPrefixFilter

A stop token filter that removes tokens beginning with any of the specified prefix words.

StopTokenSuffixFilter

A stop token filter that removes tokens ending with any of the specified suffix words.