Matches standard date formats into a provided format
Class to find standarized lemmas from words.
Matches standard date formats into a provided format
A feature transformer that converts the input array of strings (annotatorType TOKEN) into an array of n-grams (annotatorType CHUNK).
A feature transformer that converts the input array of strings (annotatorType TOKEN) into an array of n-grams (annotatorType CHUNK). Null values in the input array are ignored. It returns an array of n-grams where each n-gram is represented by a space-separated string of words.
When the input is empty, an empty array is returned. When the input array length is less than n (number of elements per n-gram), no n-grams are returned.
Annotator that cleans out tokens.
Annotator that cleans out tokens. Requires stems, hence tokens
Matches regular expressions and maps them to specified values optionally provided Rules are provided from external source file
Hard stemming of words for cut-of into standard word references
Extracts entities out of provided phrases
Tokenizes raw text into word pieces, tokens.
Class to find standarized lemmas from words. Uses a user-provided or default dictionary.