public class MockTokenizer extends Tokenizer
 This tokenizer is a replacement for WHITESPACE, SIMPLE, and KEYWORD
 tokenizers. If you are writing a component such as a TokenFilter, it's a great idea to test
 it wrapping this tokenizer instead for extra checks. This tokenizer has the following behavior:
 
setEnableChecks(boolean).
   | Modifier and Type | Field and Description | 
|---|---|
| static int | DEFAULT_MAX_TOKEN_LENGTH | 
| static CharacterRunAutomaton | KEYWORDActs Similar to KeywordTokenizer. | 
| static CharacterRunAutomaton | SIMPLEActs like LetterTokenizer. | 
| static CharacterRunAutomaton | WHITESPACEActs Similar to WhitespaceTokenizer | 
DEFAULT_TOKEN_ATTRIBUTE_FACTORY| Constructor and Description | 
|---|
| MockTokenizer() | 
| MockTokenizer(AttributeFactory factory) | 
| MockTokenizer(AttributeFactory factory,
             CharacterRunAutomaton runAutomaton,
             boolean lowerCase) | 
| MockTokenizer(AttributeFactory factory,
             CharacterRunAutomaton runAutomaton,
             boolean lowerCase,
             int maxTokenLength) | 
| MockTokenizer(CharacterRunAutomaton runAutomaton,
             boolean lowerCase) | 
| MockTokenizer(CharacterRunAutomaton runAutomaton,
             boolean lowerCase,
             int maxTokenLength) | 
| Modifier and Type | Method and Description | 
|---|---|
| void | close() | 
| void | end() | 
| boolean | incrementToken() | 
| protected boolean | isTokenChar(int c) | 
| protected int | normalize(int c) | 
| protected int | readChar() | 
| protected int | readCodePoint() | 
| void | reset() | 
| void | setEnableChecks(boolean enableChecks)Toggle consumer workflow checking: if your test consumes tokenstreams normally you
 should leave this enabled. | 
correctOffset, setReaderaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, endAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toStringpublic static final CharacterRunAutomaton WHITESPACE
public static final CharacterRunAutomaton KEYWORD
public static final CharacterRunAutomaton SIMPLE
public static final int DEFAULT_MAX_TOKEN_LENGTH
public MockTokenizer(AttributeFactory factory, CharacterRunAutomaton runAutomaton, boolean lowerCase, int maxTokenLength)
public MockTokenizer(CharacterRunAutomaton runAutomaton, boolean lowerCase, int maxTokenLength)
public MockTokenizer(CharacterRunAutomaton runAutomaton, boolean lowerCase)
public MockTokenizer()
public MockTokenizer(AttributeFactory factory, CharacterRunAutomaton runAutomaton, boolean lowerCase)
public MockTokenizer(AttributeFactory factory)
public final boolean incrementToken()
                             throws IOException
incrementToken in class TokenStreamIOExceptionprotected int readCodePoint()
                     throws IOException
IOExceptionprotected int readChar()
                throws IOException
IOExceptionprotected boolean isTokenChar(int c)
protected int normalize(int c)
public void reset()
           throws IOException
reset in class TokenizerIOExceptionpublic void close()
           throws IOException
close in interface Closeableclose in interface AutoCloseableclose in class TokenizerIOExceptionpublic void end()
         throws IOException
end in class TokenStreamIOExceptionpublic void setEnableChecks(boolean enableChecks)
Copyright © 2000-2017 Apache Software Foundation. All Rights Reserved.