Package opennlp.tools.tokenize
Class DictionaryDetokenizer
java.lang.Object
opennlp.tools.tokenize.DictionaryDetokenizer
- All Implemented Interfaces:
Detokenizer
A rule based detokenizer. Simple rules which indicate in which direction a token should be
moved are looked up in a
DetokenizationDictionary
object.- See Also:
-
Nested Class Summary
Nested classes/interfaces inherited from interface opennlp.tools.tokenize.Detokenizer
Detokenizer.DetokenizationOperation
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptiondetokenize
(String[] tokens) Detokenize the input tokens.detokenize
(String[] tokens, String splitMarker) Detokenize the input tokens into a String.
-
Constructor Details
-
DictionaryDetokenizer
-
-
Method Details
-
detokenize
Description copied from interface:Detokenizer
Detokenize the input tokens.- Specified by:
detokenize
in interfaceDetokenizer
- Parameters:
tokens
- the tokens to detokenize.- Returns:
- the merge operations to detokenize the input tokens.
-
detokenize
Description copied from interface:Detokenizer
Detokenize the input tokens into a String. Tokens which are connected without a space inbetween can be separated by a split marker.- Specified by:
detokenize
in interfaceDetokenizer
- Parameters:
tokens
- the token which should be concatenatedsplitMarker
- the split marker or null- Returns:
- the concatenated tokens
-