public interface Tokenizer
Tokenizer interface provides the ability to break-down sentences into embeddable tokens.| Modifier and Type | Method and Description |
|---|---|
java.lang.String |
buildSentence(java.util.List<java.lang.String> tokens)
Combines a list of tokens to form a sentence.
|
java.util.List<java.lang.String> |
tokenize(java.lang.String sentence)
Breaks down the given sentence into a list of tokens that can be represented by embeddings.
|
java.util.List<java.lang.String> tokenize(java.lang.String sentence)
sentence - the sentence to tokenizeList of tokensjava.lang.String buildSentence(java.util.List<java.lang.String> tokens)
tokens - the List of tokens