Class/Object

org.clulab.sequences

CombinedLexiconNER

Related Docs: object CombinedLexiconNER | package sequences

Permalink

class CombinedLexiconNER extends LexiconNER

Lexicon-based NER which efficiently recognizes entities from large dictionaries by combining like matchers

Case insensitive matching is performed by one matcher and case sensitive by the other. Each can account for multiple KBs. Each IntHashTrie stores Ints which indicate which of the KBs an entry comes from. The KBs, either from the kbs or overrideKBs in LexiconNER.apply, have priorities, and the one with highest priority is recorded.

Annotations
@SerialVersionUID()
Linear Supertypes
LexiconNER, Serializable, Serializable, Tagger[String], AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. CombinedLexiconNER
  2. LexiconNER
  3. Serializable
  4. Serializable
  5. Tagger
  6. AnyRef
  7. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new CombinedLexiconNER(caseInsensitiveMatcher: IntHashTrie, caseSensitiveMatcher: IntHashTrie, labels: Seq[String], knownCaseInsensitives: Set[String], useLemmas: Boolean, entityValidator: EntityValidator)

    Permalink

    caseInsensitiveMatcher

    A map of tries to be matched for for case insensitive KBs

    caseSensitiveMatcher

    A map of tries to be matched for for case sensitive KBs

    labels

    Labels matching all of the kbs and overrideKBs used in the matchers. They should be in the order that the kbs were specified and continue in the order that any additional labels are encountered in overrideKBs.

    knownCaseInsensitives

    Set of single-token entity names that can be spelled using lower case, according to the KB(s)

    useLemmas

    If true, tokens are matched using lemmas, otherwise using words

    entityValidator

    An object able to validate any matches that are found

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. val bLabels: Seq[String]

    Permalink
    Attributes
    protected
  6. val caseInsensitiveMatcher: IntHashTrie

    Permalink

    A map of tries to be matched for for case insensitive KBs

  7. val caseSensitiveMatcher: IntHashTrie

    Permalink

    A map of tries to be matched for for case sensitive KBs

  8. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  9. val contentQualifiers: Array[(IndexedSeqView[String, Array[String]]) ⇒ Boolean]

    Permalink
    Definition Classes
    LexiconNER
  10. def contentfulSpan(sentence: Sentence, start: Int, length: Int): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    LexiconNER
  11. def countCharacters(wordsView: IndexedSeqView[String, Array[String]]): Int

    Permalink
    Definition Classes
    LexiconNER
  12. val entityValidator: EntityValidator

    Permalink

    An object able to validate any matches that are found

  13. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  14. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  15. def equalsForSerialization(other: AnyRef): Boolean

    Permalink

    The class is serializable and this method is used during testing to determine whether a reconstitued object is equal to the original without interfering with the operation of equals and getting into hash codes.

    The class is serializable and this method is used during testing to determine whether a reconstitued object is equal to the original without interfering with the operation of equals and getting into hash codes. Is is not necessary for this operation to be efficient or complete.

    other

    The object to compare to

    returns

    Whether this and other are equal, at least as far is serialization is concerned

    Definition Classes
    CombinedLexiconNERLexiconNER
  16. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  17. def find(sentence: Sentence): Array[String]

    Permalink

    Matches the lexicons against this sentence

    Matches the lexicons against this sentence

    sentence

    The input sentence

    returns

    An array of BIO notations the store the outcome of the matches

    Definition Classes
    CombinedLexiconNERLexiconNERTagger
  18. def findLongestMatch(sentence: Sentence, caseSensitiveTokens: Array[String], caseInsensitiveTokens: Array[String]): Array[String]

    Permalink

    Finds the longest match across all matchers.

    Finds the longest match across all matchers. This means that the longest match is always chosen, even if coming from a matcher with lower priority Only ties are disambiguated according to the order provided in the constructor

    Attributes
    protected
  19. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  20. def getLabels: Seq[String]

    Permalink
    Definition Classes
    CombinedLexiconNERLexiconNER
  21. def getLemmas(sentence: Sentence): Array[String]

    Permalink
    Attributes
    protected
    Definition Classes
    LexiconNER
  22. val getTokens: (Sentence) ⇒ Array[String]

    Permalink
    Attributes
    protected
    Definition Classes
    LexiconNER
  23. def getWords(sentence: Sentence): Array[String]

    Permalink
    Attributes
    protected
    Definition Classes
    LexiconNER
  24. val hasCaseInsensitive: Boolean

    Permalink
    Attributes
    protected
  25. val hasCaseSensitive: Boolean

    Permalink
    Attributes
    protected
  26. def hasCondition(wordsView: IndexedSeqView[String, Array[String]], condition: (Char) ⇒ Boolean): Boolean

    Permalink
    Definition Classes
    LexiconNER
  27. def hasDigit(wordsView: IndexedSeqView[String, Array[String]]): Boolean

    Permalink
    Definition Classes
    LexiconNER
  28. def hasLetter(wordsView: IndexedSeqView[String, Array[String]]): Boolean

    Permalink
    Definition Classes
    LexiconNER
  29. def hasSpace(wordsView: IndexedSeqView[String, Array[String]]): Boolean

    Permalink
    Definition Classes
    LexiconNER
  30. def hasUpperCaseLetters(wordsView: IndexedSeqView[String, Array[String]]): Boolean

    Permalink
    Definition Classes
    LexiconNER
  31. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  32. val iLabels: Seq[String]

    Permalink
    Attributes
    protected
  33. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  34. val knownCaseInsensitives: Set[String]

    Permalink

    Words known to appear with and without capitalized letters which help determine whether a span of text is contentful

    Words known to appear with and without capitalized letters which help determine whether a span of text is contentful

    Definition Classes
    LexiconNER
  35. val labels: Seq[String]

    Permalink

    Labels matching all of the kbs and overrideKBs used in the matchers.

    Labels matching all of the kbs and overrideKBs used in the matchers. They should be in the order that the kbs were specified and continue in the order that any additional labels are encountered in overrideKBs.

  36. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  37. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  38. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  39. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  40. def toString(stringBuilder: StringBuilder): Unit

    Permalink
  41. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  42. val useLemmas: Boolean

    Permalink

    If false, use the words of a sentence; if true, the lemmas

    If false, use the words of a sentence; if true, the lemmas

    Definition Classes
    LexiconNER
  43. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  44. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  45. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from LexiconNER

Inherited from Serializable

Inherited from Serializable

Inherited from Tagger[String]

Inherited from AnyRef

Inherited from Any

Ungrouped