public class SimpleTokenizer extends Pipe
Modifier and Type | Field and Description |
---|---|
protected HashSet<String> |
stoplist |
static int |
USE_DEFAULT_ENGLISH_STOPLIST |
static int |
USE_EMPTY_STOPLIST |
Constructor and Description |
---|
SimpleTokenizer(File stopfile) |
SimpleTokenizer(HashSet<String> stoplist) |
SimpleTokenizer(int languageFlag) |
Modifier and Type | Method and Description |
---|---|
SimpleTokenizer |
deepClone() |
Instance |
pipe(Instance instance)
Really this should be 'protected', but isn't for historical reasons.
|
void |
stop(String word) |
alphabetsMatch, getAlphabet, getAlphabets, getDataAlphabet, getInstanceId, getTargetAlphabet, instanceFrom, instancesFrom, instancesFrom, isDataAlphabetSet, isTargetProcessing, newIteratorFrom, preceedingPipeDataAlphabetNotification, preceedingPipeTargetAlphabetNotification, precondition, readResolve, setDataAlphabet, setOrCheckDataAlphabet, setOrCheckTargetAlphabet, setTargetAlphabet, setTargetProcessing
public static final int USE_EMPTY_STOPLIST
public static final int USE_DEFAULT_ENGLISH_STOPLIST
public SimpleTokenizer(int languageFlag)
public SimpleTokenizer(File stopfile)
Copyright © 2016. All rights reserved.