Name of the DOCUMENT
Annotator type column
Name of the Sentences of DOCUMENT
Annotator type column
Name of the TOKEN
Annotator type column
Name of the POS
Annotator type column
Index of the column for NER Label in the dataset
Index of the column for the POS tags in the dataset
Index of the column for the text in the dataset
Name of the NAMED_ENTITY
Annotator type column
Whether to explode each sentence to a separate row
Delimiter used to separate columns inside CoNLL file
Index of the column for NER Label in the dataset
Index of the column for the POS tags in the dataset
Index of the column for the text in the dataset
Delimiter used to separate columns inside CoNLL file
Name of the DOCUMENT
Annotator type column
Whether to explode each sentence to a separate row
Name of the NAMED_ENTITY
Annotator type column
Name of the POS
Annotator type column
Name of the Sentences of DOCUMENT
Annotator type column
Name of the TOKEN
Annotator type column
Helper class to load a CoNLL type dataset for training.
The dataset should be in the format of CoNLL 2003 and needs to be specified with
readDataset
. Other CoNLL datasets are not supported.Example
Name of the
DOCUMENT
Annotator type columnName of the Sentences of
DOCUMENT
Annotator type columnName of the
TOKEN
Annotator type columnName of the
POS
Annotator type columnIndex of the column for NER Label in the dataset
Index of the column for the POS tags in the dataset
Index of the column for the text in the dataset
Name of the
NAMED_ENTITY
Annotator type columnWhether to explode each sentence to a separate row
Delimiter used to separate columns inside CoNLL file