Fixes common POS tagging mistakes, using the same code used by BioNLPProcessor at runtime
Fixes common POS tagging mistakes, using the same code used by BioNLPProcessor at runtime
List of tokens in one sentence
Reads IOB data directly into Java lists, because the CRF needs the data of this type
Splits a line into k tokens, knowing that the left-most one might contain spaces