org.clulab.sequences
Contains all bigrams in training that occur more BIGRAM_THRESHOLD times
Counts the bigrams seen in this corpus so we filter out the non-frequent ones
The training corpus