Chains a pipeline onto the end of this one, producing a new pipeline.
Chains a pipeline onto the end of this one, producing a new pipeline.
the pipeline to chain
Apply this Transformer to a single input item
Apply this Transformer to a single input item
The input item to pass into this transformer
The output value
Apply this Transformer to an RDD of input items
Apply this Transformer to an RDD of input items
The bulk RDD input to pass into this transformer
The bulk RDD output for the given input
The size of the n-grams to output
A graphviz dot representation of this pipeline
Transformer that uses CoreNLP to (in order): - Tokenize document - Lemmatize tokens - Replace entities w/ their type (e.g. "Jon" => "NAME", "Paris" => "PLACE") - Return n-grams for the above (respecting sentence boundaries) Note: Much slower than just using Tokenizer followed by NGramsFeaturizer
The size of the n-grams to output