Package org.apache.solr.analysis

Interface Summary
CharFilterFactory  
MultiTermAwareComponent Add to any analysis factory component to allow returning an analysis component factory for use with partial terms in prefix queries, wildcard queries, range query endpoints, regex queries, etc.
TokenFilterFactory A TokenFilterFactory creates a TokenFilter to transform one TokenStream into another.
TokenizerFactory A TokenizerFactory breaks up a stream of characters into tokens.
 

Class Summary
ArabicLetterTokenizerFactory Deprecated. (3.1) Use StandardTokenizerFactory instead.
ArabicNormalizationFilterFactory Factory for ArabicNormalizationFilter.
ArabicStemFilterFactory Factory for ArabicStemFilter.
ASCIIFoldingFilterFactory Factory for ASCIIFoldingFilter.
BaseCharFilterFactory  
BaseTokenFilterFactory Simple abstract implementation that handles init arg processing.
BaseTokenizerFactory Simple abstract implementation that handles init arg processing.
BeiderMorseFilterFactory Factory for BeiderMorseFilter.
BrazilianStemFilterFactory Factory for BrazilianStemFilter.
BufferedTokenStream Deprecated. This class does not support custom attributes.
BulgarianStemFilterFactory Factory for BulgarianStemFilter.
CapitalizationFilterFactory A filter to apply normal capitalization rules to Tokens.
ChineseFilterFactory Deprecated. Use StopFilterFactory instead.
ChineseTokenizerFactory Deprecated. Use StandardTokenizerFactory instead.
CJKBigramFilterFactory Factory for CJKBigramFilter.
CJKTokenizerFactory Deprecated.  
CJKWidthFilterFactory Factory for CJKWidthFilter.
ClassicFilterFactory Factory for ClassicFilter.
ClassicTokenizerFactory Factory for ClassicTokenizer.
CollationKeyFilterFactory Factory for CollationKeyFilter.
CommonGramsFilter Construct bigrams for frequently occurring terms while indexing.
CommonGramsFilterFactory Constructs a CommonGramsFilter.
CommonGramsQueryFilter Wrap a CommonGramsFilter optimizing phrase queries by only returning single words when they are not a member of a bigram.
CommonGramsQueryFilterFactory Construct CommonGramsQueryFilter.
CzechStemFilterFactory Factory for CzechStemFilter.
DelimitedPayloadTokenFilterFactory Factory for DelimitedPayloadTokenFilter.
DictionaryCompoundWordTokenFilterFactory Factory for DictionaryCompoundWordTokenFilter.
DoubleMetaphoneFilter Deprecated. Use DoubleMetaphoneFilter instead
DoubleMetaphoneFilterFactory Factory for DoubleMetaphoneFilter.
DutchStemFilterFactory Deprecated. Use SnowballPorterFilterFactory with "Dutch" instead, which has the same functionality.
EdgeNGramFilterFactory Creates new instances of EdgeNGramTokenFilter.
EdgeNGramTokenizerFactory Creates new instances of EdgeNGramTokenizer.
ElisionFilterFactory Factory for ElisionFilter.
EnglishMinimalStemFilterFactory Factory for EnglishMinimalStemFilter.
EnglishPorterFilterFactory Deprecated. Use SnowballPorterFilterFactory with language="English" instead
EnglishPossessiveFilterFactory Factory for EnglishPossessiveFilter.
FinnishLightStemFilterFactory Factory for FinnishLightStemFilter.
FrenchLightStemFilterFactory Factory for FrenchLightStemFilter.
FrenchMinimalStemFilterFactory Factory for FrenchMinimalStemFilter.
FrenchStemFilterFactory Deprecated. Use SnowballPorterFilterFactory with "French" instead, which has the same functionality.
GalicianMinimalStemFilterFactory Factory for GalicianMinimalStemFilter.
GalicianStemFilterFactory Factory for GalicianStemFilter.
GermanLightStemFilterFactory Factory for GermanLightStemFilter.
GermanMinimalStemFilterFactory Factory for GermanMinimalStemFilter.
GermanNormalizationFilterFactory Factory for GermanNormalizationFilter.
GermanStemFilterFactory Factory for GermanStemFilter.
GreekLowerCaseFilterFactory Factory for GreekLowerCaseFilter.
GreekStemFilterFactory Factory for GreekStemFilter.
HindiNormalizationFilterFactory Factory for HindiNormalizationFilter.
HindiStemFilterFactory Factory for HindiStemFilter.
HTMLStripCharFilterFactory Factory for HTMLStripCharFilter.
HungarianLightStemFilterFactory Factory for HungarianLightStemFilter.
HunspellStemFilterFactory TokenFilterFactory that creates instances of HunspellStemFilter.
HyphenatedWordsFilter When the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines.
HyphenatedWordsFilterFactory Factory for HyphenatedWordsFilter.
HyphenationCompoundWordTokenFilterFactory Factory for HyphenationCompoundWordTokenFilter.
IndicNormalizationFilterFactory Factory for IndicNormalizationFilter.
IndonesianStemFilterFactory Factory for IndonesianStemFilter.
IrishLowerCaseFilterFactory Factory for IrishLowerCaseFilter.
ISOLatin1AccentFilterFactory Deprecated. Use ASCIIFoldingFilterFactory instead.
ItalianLightStemFilterFactory Factory for ItalianLightStemFilter.
JapaneseBaseFormFilterFactory Factory for JapaneseBaseFormFilter.
JapaneseKatakanaStemFilterFactory Factory for JapaneseKatakanaStemFilterFactory.
JapanesePartOfSpeechStopFilterFactory Factory for JapanesePartOfSpeechStopFilter.
JapaneseReadingFormFilterFactory Factory for JapaneseReadingFormFilter.
JapaneseTokenizerFactory Factory for JapaneseTokenizer.
KeepWordFilter A TokenFilter that only keeps tokens with text contained in the required words.
KeepWordFilterFactory Factory for KeepWordFilter.
KeywordMarkerFilterFactory Factory for KeywordMarkerFilter.
KeywordTokenizerFactory Factory for KeywordTokenizer.
KStemFilterFactory Factory for KStemFilter
LatvianStemFilterFactory Factory for LatvianStemFilter.
LegacyHTMLStripCharFilter Deprecated. use HTMLStripCharFilter
LegacyHTMLStripCharFilterFactory Deprecated. use HTMLStripCharFilterFactory
LengthFilterFactory Factory for LengthFilter.
LetterTokenizerFactory Factory for LetterTokenizer.
LimitTokenCountFilterFactory Factory for LimitTokenCountFilter.
LowerCaseFilterFactory Factory for LowerCaseFilter.
LowerCaseTokenizerFactory Factory for LowerCaseTokenizer.
MappingCharFilterFactory Factory for MappingCharFilter.
NGramFilterFactory Factory for NGramTokenFilter.
NGramTokenizerFactory Factory for NGramTokenizer.
NorwegianLightStemFilterFactory Factory for NorwegianLightStemFilter.
NorwegianMinimalStemFilterFactory Factory for NorwegianMinimalStemFilter.
NumericPayloadTokenFilterFactory Factory for NumericPayloadTokenFilter.
PathHierarchyTokenizerFactory Factory for PathHierarchyTokenizer.
PatternReplaceCharFilter CharFilter that uses a regular expression for the target of replace string.
PatternReplaceCharFilterFactory Factory for PatternReplaceCharFilter.
PatternReplaceFilter A TokenFilter which applies a Pattern to each token in the stream, replacing match occurances with the specified replacement string.
PatternReplaceFilterFactory Factory for PatternReplaceFilter.
PatternTokenizer This tokenizer uses regex pattern matching to construct distinct tokens for the input stream.
PatternTokenizerFactory Factory for PatternTokenizer.
PersianCharFilterFactory Factory for PersianCharFilter.
PersianNormalizationFilterFactory Factory for PersianNormalizationFilter.
PhoneticFilter Deprecated. Use PhoneticFilter instead.
PhoneticFilterFactory Factory for PhoneticFilter.
PorterStemFilterFactory Factory for PorterStemFilter.
PortugueseLightStemFilterFactory Factory for PortugueseLightStemFilter.
PortugueseMinimalStemFilterFactory Factory for PortugueseMinimalStemFilter.
PortugueseStemFilterFactory Factory for PortugueseStemFilter.
PositionFilterFactory Factory for PositionFilter.
RemoveDuplicatesTokenFilter A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.
RemoveDuplicatesTokenFilterFactory Factory for RemoveDuplicatesTokenFilter.
ReversedWildcardFilter This class produces a special form of reversed tokens, suitable for better handling of leading wildcards.
ReversedWildcardFilterFactory Factory for ReversedWildcardFilter-s.
ReverseStringFilterFactory Factory for ReverseStringFilter.
RussianLetterTokenizerFactory Deprecated. Use StandardTokenizerFactory instead.
RussianLightStemFilterFactory Factory for RussianLightStemFilter.
RussianLowerCaseFilterFactory Deprecated. Use LowerCaseFilterFactory instead which has the same functionality.
RussianStemFilterFactory Deprecated. Use SnowballPorterFilterFactory with "Russian" instead, which has the same functionality.
ShingleFilterFactory Factory for ShingleFilter.
SnowballPorterFilterFactory Factory for SnowballFilter, with configurable language
SolrAnalyzer  
SolrAnalyzer.TokenStreamInfo  
SpanishLightStemFilterFactory Factory for SpanishLightStemFilter.
StandardFilterFactory Factory for StandardFilter.
StandardTokenizerFactory Factory for StandardTokenizer.
StemmerOverrideFilterFactory Factory for StemmerOverrideFilter.
StopFilterFactory Factory for StopFilter.
SwedishLightStemFilterFactory Factory for SwedishLightStemFilter.
SynonymFilterFactory Factory for SynonymFilter.
ThaiWordFilterFactory Factory for ThaiWordFilter.
TokenizerChain  
TokenOffsetPayloadTokenFilterFactory Factory for TokenOffsetPayloadTokenFilter.
TrieTokenizerFactory Tokenizer for trie fields.
TrimFilter Trims leading and trailing whitespace from Tokens in the stream.
TrimFilterFactory Factory for TrimFilter.
TurkishLowerCaseFilterFactory Factory for TurkishLowerCaseFilter.
TypeAsPayloadTokenFilterFactory Factory for TypeAsPayloadTokenFilter.
TypeTokenFilterFactory Factory class for TypeTokenFilter.
UAX29URLEmailTokenizerFactory Factory for UAX29URLEmailTokenizer.
WhitespaceTokenizerFactory Factory for WhitespaceTokenizer.
WikipediaTokenizerFactory Factory for WikipediaTokenizer.
WordDelimiterFilterFactory Factory for WordDelimiterFilter.
WordDelimiterIterator A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules.