Calculate precision, recall, and f1 for each label base on scores of form (gold, predicted)
Calculate precision, recall, and f1 for each label base on scores of form (gold, predicted)
Map from label to Performance
Creates dataset folds to be used for cross validation
Implements stratified cross validation; producing pairs of gold/predicted labels across the training dataset.
Implements stratified cross validation; producing pairs of gold/predicted labels across the training dataset. Each fold is as balanced as possible by label L.