LexiconNER

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
val KNOWN_CASE_INSENSITIVE_LENGTH: Int
val OUTSIDE_LABEL: String
val OVERRIDE_ENTITY_VALIDATOR: Boolean
var USE_COMPACT: Boolean

If the FastLexiconNERBuilder is beind used, indicates when true that a CompactLexiconNER should be created and otherwise a CombinedLexiconNER
val USE_DEBUG: Boolean
var USE_FAST: Boolean

When true indicates use of the FastLexiconNERBuilder and otherwise the SlowLexiconNERBuilder to construct the LexiconNER
def apply(kbs: Seq[String], entityValidator: EntityValidator = new TrueEntityValidator, useLemmasForMatching: Boolean = false, caseInsensitiveMatching: Boolean = false): LexiconNER

Creates a LexiconNER from a list of KBs Note that file name (minus the extension) for each KB becomes the name of the corresponding category.
Creates a LexiconNER from a list of KBs Note that file name (minus the extension) for each KB becomes the name of the corresponding category. For example, /Some/Path/SomeCategory.tsv.gz yields the category name SomeCategory. Each of the KBs must contain one entity name per line
kbs
KBs containing known entity names
entityValidator
Filter which decides if a matched entity is valid
useLemmasForMatching
If true, we use Sentence.lemmas instead of Sentence.words during matching
caseInsensitiveMatching
If true, tokens are matched case insensitively
returns
The new LexiconNER
def apply(kbs: Seq[String], overrideKBs: Option[Seq[String]], entityValidator: EntityValidator, lexicalVariationEngine: LexicalVariations, useLemmasForMatching: Boolean, caseInsensitiveMatching: Boolean): LexiconNER

Creates a LexiconNER from a list of KBs Note that file name (minus the extension) for each KB becomes the name of the corresponding category.
Creates a LexiconNER from a list of KBs Note that file name (minus the extension) for each KB becomes the name of the corresponding category. For example, /Some/Path/SomeCategory.tsv.gz yields the category name SomeCategory. Each of the KBs must contain one entity name per line
kbs
KBs containing known entity names
overrideKBs
KBs containing override labels for entity names from kbs (necessary for the bio domain)
entityValidator
Filter which decides if a matched entity is valid
lexicalVariationEngine
Generates alternative spellings of an entity name (necessary for the bio domain)
useLemmasForMatching
If true, we use Sentence.lemmas instead of Sentence.words during matching
caseInsensitiveMatching
If true, tokens are matched case insensitively
returns
The new LexiconNER
def apply(kbs: Seq[String], caseInsensitiveMatchings: Seq[Boolean], baseDirOpt: Option[File]): LexiconNER

This is just like the above but with the addition of the baseDirOpt.
def apply(kbs: Seq[String], caseInsensitiveMatchings: Seq[Boolean]): LexiconNER

Same apply with even more default values filled in
def apply(kbs: Seq[String], caseInsensitiveMatchings: Seq[Boolean], useLemmasForMatching: Boolean): LexiconNER

Same apply with more default values filled in
def apply(kbs: Seq[String], caseInsensitiveMatchings: Seq[Boolean], entityValidator: EntityValidator, useLemmasForMatching: Boolean): LexiconNER

Same apply with some default values filled in
def apply(kbs: Seq[String], overrideKBs: Option[Seq[String]], caseInsensitiveMatchings: Seq[Boolean], entityValidator: EntityValidator, lexicalVariationEngine: LexicalVariations, useLemmasForMatching: Boolean, defaultCaseInsensitive: Boolean, baseDirOpt: Option[File]): LexiconNER

Create a LexiconNER from a pair of sequences of knowledge bases (KBs), the kbs and overrideKBs, with control over the case sensitivity of individual KBs via caseInsensitiveMatchings
Create a LexiconNER from a pair of sequences of knowledge bases (KBs), the kbs and overrideKBs, with control over the case sensitivity of individual KBs via caseInsensitiveMatchings
The matchings run parallel to the KBs. That is, caseInsensitiveMatchings(n) is used for kbs(n). It is possible that contents of an overrideKB refers to a KB that does not exist. In that situation, caseInsensitiveMatching is used as a fallback value.
kbs
KBs containing known entity names
overrideKBs
KBs containing override labels for entity names from kbs (necessary for the bio domain)
caseInsensitiveMatchings
case insensitivities corresponding to the kbs, matched by index
entityValidator
Filter which decides if a matched entity is valid
lexicalVariationEngine
Generates alternative spellings of an entity name (necessary for the bio domain)
useLemmasForMatching
If true, we use Sentence.lemmas instead of Sentence.words during matching
defaultCaseInsensitive
If true, tokens are matched case insensitively
baseDirOpt
An optional directory to force kbs to be loaded from files rather than resources
returns
The new LexiconNER
def apply(standardKbSources: Seq[StandardKbSource], overrideKbSourcesOpt: Option[Seq[OverrideKbSource]], lexicalVariationEngine: LexicalVariations, entityValidator: EntityValidator, useLemmasForMatching: Boolean, defaultCaseInsensitive: Boolean): LexiconNER

Create a LexiconNER from a pair of sequences of knowledge base sources for the kbs and overrideKBs.
Create a LexiconNER from a pair of sequences of knowledge base sources for the kbs and overrideKBs. There are versions of the sources for knowledge bases stored in files and those stored in memory. Each StandardKbSource knows its own caseSensitivityMatching, so no list of those need be supplied. It is possible that contents of an overrideKB refers to a KB (label) that does not exist. In that situation, caseInsensitiveMatching is used as a fallback value. With that, this method should encompass all the functionality of the other apply methods, which now feed into it. Note that some of the arugments are in a different order than the other build methods in order to overload the method despite type erasure.
standardKbSources
KB sources containing known entity names
overrideKbSourcesOpt
KB sources containing override labels for entity names from kbs (necessary for the bio domain)
lexicalVariationEngine
Generates alternative spellings of an entity name (necessary for the bio domain)
entityValidator
Filter which decides if a matched entity is valid
useLemmasForMatching
If true, we use Sentence.lemmas instead of Sentence.words during matching
defaultCaseInsensitive
If true, tokens are matched case insensitively
returns
The new LexiconNER
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def countWhile(labels: Array[String], offset: Int, condition: (String) ⇒ Boolean): Int
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def isNotOutside(label: String): Boolean
def isOutside(label: String): Boolean
def mergeLabels(dst: Array[String], src: Array[String]): Unit

Merges labels from src into dst without overlapping any existing labels in dst.
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def scanText(words: Array[String], start: Int, end: Int): (Int, Int, Int, Int, Int)
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

Related Docs: class LexiconNER | package sequences

object LexiconNER extends Serializable

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

val KNOWN_CASE_INSENSITIVE_LENGTH: Int

val OUTSIDE_LABEL: String

val OVERRIDE_ENTITY_VALIDATOR: Boolean

var USE_COMPACT: Boolean

val USE_DEBUG: Boolean

var USE_FAST: Boolean

def apply(kbs: Seq[String], entityValidator: EntityValidator = new TrueEntityValidator, useLemmasForMatching: Boolean = false, caseInsensitiveMatching: Boolean = false): LexiconNER

def apply(kbs: Seq[String], overrideKBs: Option[Seq[String]], entityValidator: EntityValidator, lexicalVariationEngine: LexicalVariations, useLemmasForMatching: Boolean, caseInsensitiveMatching: Boolean): LexiconNER

def apply(kbs: Seq[String], caseInsensitiveMatchings: Seq[Boolean], baseDirOpt: Option[File]): LexiconNER

def apply(kbs: Seq[String], caseInsensitiveMatchings: Seq[Boolean]): LexiconNER

def apply(kbs: Seq[String], caseInsensitiveMatchings: Seq[Boolean], useLemmasForMatching: Boolean): LexiconNER

def apply(kbs: Seq[String], caseInsensitiveMatchings: Seq[Boolean], entityValidator: EntityValidator, useLemmasForMatching: Boolean): LexiconNER

def apply(kbs: Seq[String], overrideKBs: Option[Seq[String]], caseInsensitiveMatchings: Seq[Boolean], entityValidator: EntityValidator, lexicalVariationEngine: LexicalVariations, useLemmasForMatching: Boolean, defaultCaseInsensitive: Boolean, baseDirOpt: Option[File]): LexiconNER

def apply(standardKbSources: Seq[StandardKbSource], overrideKbSourcesOpt: Option[Seq[OverrideKbSource]], lexicalVariationEngine: LexicalVariations, entityValidator: EntityValidator, useLemmasForMatching: Boolean, defaultCaseInsensitive: Boolean): LexiconNER

final def asInstanceOf[T0]: T0

def clone(): AnyRef

def countWhile(labels: Array[String], offset: Int, condition: (String) ⇒ Boolean): Int

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

final def getClass(): Class[_]

def hashCode(): Int

final def isInstanceOf[T0]: Boolean

def isNotOutside(label: String): Boolean

def isOutside(label: String): Boolean

def mergeLabels(dst: Array[String], src: Array[String]): Unit

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

def scanText(words: Array[String], start: Int, end: Int): (Int, Int, Int, Int, Int)

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped