A map of tries to be matched for each given category label The order of the matchers is important: it indicates priority during ties (first has higher priority)
Set of single-token entity names that can be spelled using lower case, according to the KB(s)
If true, tokens are matched using lemmas, otherwise using words Author: mihais Created: 5/11/15 Modified: 9/27/17 - Clean up from RuleNER into LexiconNER
Matches the lexicons against this sentence
Matches the lexicons against this sentence
The input sentence
An array of BIO notations the store the outcome of the matches
Finds the longest match across all matchers.
Finds the longest match across all matchers. This means that the longest match is always chosen, even if coming from a matcher with lower priority Only ties are disambiguated according to the order provided in the constructor
A map of tries to be matched for each given category label The order of the matchers is important: it indicates priority during ties (first has higher priority)
Lexicon-based NER, which efficiently recognizes entities from large dictionaries Note: This is a cleaned-up version of the old RuleNER. Create a LexiconNER object using LexiconNER.apply() (not the c'tor, which is private). Use it by calling the find() method on a single sentence. See org.clulab.processors.TextLexiconNER for usage examples.