Class EnglishTextTokenizer

    • Constructor Detail

      • EnglishTextTokenizer

        public EnglishTextTokenizer()
    • Method Detail

      • getLanguage

        public Languages getLanguage()
        Description copied from interface: TextTokenizer
        Gets the language for the tokenizer.
        Returns:
        the language for this tokenizer.
      • stopWords

        public Set<String> stopWords()
        Description copied from interface: TextTokenizer
        Gets all stop-words for a language.
        Returns:
        the set of all stop-words.