Chains a pipeline onto the end of this one, producing a new pipeline.
Chains a pipeline onto the end of this one, producing a new pipeline.
the pipeline to chain
Apply this Transformer to a single input item
Apply this Transformer to a single input item
The output value
Apply this Transformer to an RDD of input items
Apply this Transformer to an RDD of input items
The bulk RDD input to pass into this transformer
The bulk RDD output for the given input
A graphviz dot representation of this pipeline
the counts of unigrams in the training corpus
Encodes string tokens as non-negative integers, which are indices of the tokens' positions in the sorted-by-frequency order. Out-of-vocabulary words are mapped to the special index -1.
The parameters passed to this class are usually calculated by WordFrequencyEncoder.