Character separating annotations (Default: @
)
Whether to remove annotation columns (Default: true
)
Character separating annotations (Default: #
)
Whether to remove annotation columns (Default: true
)
Annotation metadata format (Default: false
)
Name of finisher output cols
Finisher generates an Array with the results instead of string (Default: true
)
Name of input annotation cols
Character separating annotations (Default: #
)
Annotation metadata format (Default: false
)
Name of input annotation cols
Finisher generates an Array with the results instead of string (Default: true
)
Name of finisher output cols
Whether to include embeddings vectors in the process (Default: false
)
Character separating annotations (Default: #
)
Whether to remove annotation columns (Default: true
)
Annotation metadata format (Default: false
)
Name of input annotation cols
Name of input annotation cols
Finisher generates an Array with the results instead of string (Default: true
)
Name of finisher output cols
Name of finisher output cols
Character separating annotations (Default: #
)
required uid for storing annotator to disk
required uid for storing annotator to disk
Character separating annotations (Default: #
)
A list of (hyper-)parameter keys this annotator can take. Users can set and get the parameter values through setters and getters, respectively.
Converts annotation results into a format that easier to use. It is useful to extract the results from Spark NLP Pipelines. The Finisher outputs annotation(s) values into
String
.For more extended examples on document pre-processing see the Spark NLP Workshop.
Example
EmbeddingsFinisher for finishing embeddings