Interface | Description |
---|---|
StandardTokenizerInterface |
Internal interface for supporting versioned grammars.
|
Class | Description |
---|---|
AbstractAnalyzer | |
DelimiterAnalyzer | |
DelimiterTokenizingOptions |
Simple tokenizer based on a specified delimiter (rather than whitespace).
|
NonTokenizingAnalyzer |
Analyzer that does *not* tokenize the input.
|
NonTokenizingOptions | |
NonTokenizingOptions.OptionsBuilder | |
NoOpAnalyzer |
Default noOp tokenizer.
|
StandardAnalyzer | |
StandardTokenizerImpl |
This class implements Word Break rules from the Unicode Text Segmentation
algorithm, as specified in
Unicode Standard Annex #29.
|
StandardTokenizerOptions |
Various options for controlling tokenization and enabling
or disabling features
|
StandardTokenizerOptions.OptionsBuilder |
Enum | Description |
---|---|
StandardAnalyzer.TokenType |
Copyright © 2019 The Apache Software Foundation