Package

com.johnsnowlabs.nlp

annotators

Permalink

package annotators

Visibility
  1. Public
  2. All

Type Members

  1. class ChunkTokenizer extends Tokenizer

    Permalink
  2. class ChunkTokenizerModel extends TokenizerModel

    Permalink
  3. class Chunker extends AnnotatorModel[Chunker]

    Permalink
  4. class DateMatcher extends AnnotatorModel[DateMatcher] with DateMatcherUtils

    Permalink

    Matches standard date formats into a provided format

  5. trait DateMatcherUtils extends Params

    Permalink
  6. class Lemmatizer extends AnnotatorApproach[LemmatizerModel]

    Permalink

    Class to find standarized lemmas from words.

    Class to find standarized lemmas from words. Uses a user-provided or default dictionary.

  7. class LemmatizerModel extends AnnotatorModel[LemmatizerModel]

    Permalink
  8. class MultiDateMatcher extends AnnotatorModel[MultiDateMatcher] with DateMatcherUtils

    Permalink

    Matches standard date formats into a provided format

  9. class NGramGenerator extends AnnotatorModel[NGramGenerator]

    Permalink

    A feature transformer that converts the input array of strings (annotatorType TOKEN) into an array of n-grams (annotatorType CHUNK).

    A feature transformer that converts the input array of strings (annotatorType TOKEN) into an array of n-grams (annotatorType CHUNK). Null values in the input array are ignored. It returns an array of n-grams where each n-gram is represented by a space-separated string of words.

    When the input is empty, an empty array is returned. When the input array length is less than n (number of elements per n-gram), no n-grams are returned.

  10. class Normalizer extends AnnotatorApproach[NormalizerModel]

    Permalink

    Annotator that cleans out tokens.

    Annotator that cleans out tokens. Requires stems, hence tokens

  11. class NormalizerModel extends AnnotatorModel[NormalizerModel]

    Permalink
  12. trait ReadablePretrainedLemmatizer extends ParamsAndFeaturesReadable[LemmatizerModel] with HasPretrained[LemmatizerModel]

    Permalink
  13. trait ReadablePretrainedTextMatcher extends ParamsAndFeaturesReadable[TextMatcherModel] with HasPretrained[TextMatcherModel]

    Permalink
  14. trait ReadablePretrainedTokenizer extends ParamsAndFeaturesReadable[TokenizerModel] with HasPretrained[TokenizerModel]

    Permalink
  15. class RegexMatcher extends AnnotatorApproach[RegexMatcherModel]

    Permalink
  16. class RegexMatcherModel extends AnnotatorModel[RegexMatcherModel]

    Permalink

    Matches regular expressions and maps them to specified values optionally provided Rules are provided from external source file

  17. class SimpleTokenizer extends AnnotatorModel[SimpleTokenizer]

    Permalink
  18. class Stemmer extends AnnotatorModel[Stemmer]

    Permalink

    Hard stemming of words for cut-of into standard word references

  19. class StopWordsCleaner extends AnnotatorModel[StopWordsCleaner]

    Permalink
  20. class TextMatcher extends AnnotatorApproach[TextMatcherModel] with ParamsAndFeaturesWritable

    Permalink
  21. class TextMatcherModel extends AnnotatorModel[TextMatcherModel]

    Permalink

    Extracts entities out of provided phrases

  22. class Token2Chunk extends AnnotatorModel[Token2Chunk]

    Permalink
  23. class Tokenizer extends AnnotatorApproach[TokenizerModel]

    Permalink
  24. class TokenizerModel extends AnnotatorModel[TokenizerModel]

    Permalink

    Tokenizes raw text into word pieces, tokens.

Value Members

  1. object ChunkTokenizer extends DefaultParamsReadable[ChunkTokenizer] with Serializable

    Permalink
  2. object ChunkTokenizerModel extends ParamsAndFeaturesReadable[ChunkTokenizerModel] with Serializable

    Permalink
  3. object Chunker extends DefaultParamsReadable[Chunker] with Serializable

    Permalink
  4. object DateMatcher extends DefaultParamsReadable[DateMatcher] with Serializable

    Permalink
  5. object EnglishStemmer

    Permalink
  6. object Lemmatizer extends DefaultParamsReadable[Lemmatizer] with Serializable

    Permalink
  7. object LemmatizerModel extends ReadablePretrainedLemmatizer with Serializable

    Permalink
  8. object MultiDateMatcher extends DefaultParamsReadable[MultiDateMatcher] with Serializable

    Permalink
  9. object NGramGenerator extends ParamsAndFeaturesReadable[NGramGenerator] with Serializable

    Permalink
  10. object Normalizer extends DefaultParamsReadable[Normalizer] with Serializable

    Permalink
  11. object NormalizerModel extends ParamsAndFeaturesReadable[NormalizerModel] with Serializable

    Permalink
  12. object RegexMatcher extends DefaultParamsReadable[RegexMatcher] with Serializable

    Permalink
  13. object RegexMatcherModel extends ParamsAndFeaturesReadable[RegexMatcherModel] with Serializable

    Permalink
  14. object Stemmer extends DefaultParamsReadable[Stemmer] with Serializable

    Permalink
  15. object StopWordsCleaner extends ParamsAndFeaturesReadable[StopWordsCleaner] with Serializable

    Permalink
  16. object TextMatcher extends DefaultParamsReadable[TextMatcher] with Serializable

    Permalink
  17. object TextMatcherModel extends ReadablePretrainedTextMatcher with Serializable

    Permalink
  18. object Token2Chunk extends DefaultParamsReadable[Token2Chunk] with Serializable

    Permalink
  19. object Tokenizer extends DefaultParamsReadable[Tokenizer] with Serializable

    Permalink
  20. object TokenizerModel extends ReadablePretrainedTokenizer with Serializable

    Permalink
  21. package btm

    Permalink
  22. package common

    Permalink
  23. package ner

    Permalink
  24. package param

    Permalink
  25. package parser

    Permalink
  26. package pos

    Permalink
  27. package sbd

    Permalink
  28. package sda

    Permalink
  29. package spell

    Permalink

Ungrouped