Given a list of strings and a minimum threshold qualifyingCount
, returns the set of
strings that appear at least qualifyingCount
times in the argument list.
Given a list of strings and a minimum threshold qualifyingCount
, returns the set of
strings that appear at least qualifyingCount
times in the argument list.
the strings that we want to count
the minimum frequency of words to include in the return value
the set of strings that appear at least qualifyingCount
times in words
Maps standard Penn Treebank-style part-of-speech tags into the Google's "universal" POS set:
Maps standard Penn Treebank-style part-of-speech tags into the Google's "universal" POS set:
https://code.google.com/p/universal-pos-tags/
TODO: move this to the nlpstack.postag library
Given a list of strings, creates a histogram that maps the strings to their frequency in the list.
Given a list of strings, creates a histogram that maps the strings to their frequency in the list.
the strings that we want to count
a mapping from strings to their frequency in the argument list