CzechAnalyzer (The Adobe Experience Manager SDK 2020.6.3694.20200609T225153Z-200604)

Skip navigation links

Prev Class
Next Class

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method

java.lang.Object
- org.apache.lucene.analysis.Analyzer
- - org.apache.lucene.analysis.util.StopwordAnalyzerBase
  - - org.apache.lucene.analysis.cz.CzechAnalyzer

All Implemented Interfaces:

Closeable, AutoCloseable
```
public final class CzechAnalyzer
extends StopwordAnalyzerBase
```
Analyzer for Czech language.
Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.

You must specify the required Version compatibility when creating CzechAnalyzer:
- As of 3.1, words are stemmed with CzechStemFilter
- As of 2.9, StopFilter preserves position increments
- As of 2.4, Tokens incorrectly identified as acronyms are corrected (see LUCENE-1068)

Nested Class Summary
- Nested classes/interfaces inherited from class org.apache.lucene.analysis.Analyzer
  Analyzer.GlobalReuseStrategy, Analyzer.PerFieldReuseStrategy, Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents

Field Summary

Fields
Modifier and Type Field and Description

static String DEFAULT_STOPWORD_FILE
File containing default Czech stopwords.
- Fields inherited from class org.apache.lucene.analysis.Analyzer
  GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY

Constructor Summary

Constructors
Constructor and Description
`CzechAnalyzer(Version matchVersion)` Builds an analyzer with the default stop words (`getDefaultStopSet()`).
`CzechAnalyzer(Version matchVersion, CharArraySet stopwords)` Builds an analyzer with the given stop words.
`CzechAnalyzer(Version matchVersion, CharArraySet stopwords, CharArraySet stemExclusionTable)` Builds an analyzer with the given stop words and a set of work to be excluded from the `CzechStemFilter`.

Method Summary

All Methods Static Methods Concrete Methods
Modifier and Type Method and Description

static CharArraySet getDefaultStopSet()
Returns a set of default Czech-stopwords
- Methods inherited from class org.apache.lucene.analysis.util.StopwordAnalyzerBase
  getStopwordSet
- Methods inherited from class org.apache.lucene.analysis.Analyzer
  close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, tokenStream, tokenStream
- Methods inherited from class java.lang.Object
  equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Field Detail
  - DEFAULT_STOPWORD_FILE
```
public static final String DEFAULT_STOPWORD_FILE
```
    File containing default Czech stopwords.
    
    See Also:
    
    Constant Field Values
- Constructor Detail
  - CzechAnalyzer
```
public CzechAnalyzer(Version matchVersion)
```
    Builds an analyzer with the default stop words (getDefaultStopSet()).
    
    Parameters:
    
    matchVersion - Lucene version to match See above
  - CzechAnalyzer
```
public CzechAnalyzer(Version matchVersion,
                     CharArraySet stopwords)
```
    Builds an analyzer with the given stop words.
    
    Parameters:
    
    matchVersion - Lucene version to match See above
    
    stopwords - a stopword set
  - CzechAnalyzer
```
public CzechAnalyzer(Version matchVersion,
                     CharArraySet stopwords,
                     CharArraySet stemExclusionTable)
```
    Builds an analyzer with the given stop words and a set of work to be excluded from the CzechStemFilter.
    
    Parameters:
    
    matchVersion - Lucene version to match See above
    
    stopwords - a stopword set
    
    stemExclusionTable - a stemming exclusion set
- Method Detail
  - getDefaultStopSet
```
public static final CharArraySet getDefaultStopSet()
```
    Returns a set of default Czech-stopwords
    
    Returns:
    
    a set of default Czech-stopwords

Skip navigation links

Prev Class
Next Class

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method

Copyright © 2010 - 2020 Adobe. All Rights Reserved