com.gilt.nlp

OpenNlpExtractor

class OpenNlpExtractor extends Extractor with Loggable

This class is known to be not threadsafe.

Linear Supertypes
Loggable, Extractor, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. OpenNlpExtractor
  2. Loggable
  3. Extractor
  4. AnyRef
  5. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new OpenNlpExtractor(stopWords: Set[String] = scala.this.Predef.Set.empty[String], phraseEndStopWords: Set[String] = scala.this.Predef.Set.empty[String], postfixesToStrip: Set[String] = ..., distanceThreshold: Int = 4)

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  7. def chunkerModel: ChunkerModel

    Attributes
    protected
  8. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  9. def debug(message: ⇒ String, ex: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  10. def debug(message: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  11. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  12. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  13. def error(message: ⇒ String, ex: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  14. def error(message: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  15. def error(ex: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  16. def extractCountedNounPhrases(body: String): Iterable[(String, Int)]

    Extract noun phrases and their counts from a given body of text.

    Extract noun phrases and their counts from a given body of text.

    body
    returns

    Definition Classes
    OpenNlpExtractorExtractor
  17. def extractInterestingPhrases(body: String, allowSkipPhrases: Boolean = true): Iterable[String]

    Inspired by http://dragon.ischool.drexel.edu/xtract.asp

    Inspired by http://dragon.ischool.drexel.edu/xtract.asp

    Using POS tags as described here

    This function implements something akin to the xTract description, by doing the following:

    $ - Sentence detection (to only pull phrases from individual sentences) $ - POS ("Part of Speech") Tagging, to allow processing only certain parts of speech (Nouns, etc.) $ - "Chunking" - to restrict phrase extraction to appropriate sub-sentence structures $ - Filtering chunks to "noun phrases" only $ - Extraction of words based on some simple rules on POS: $ - First word can be either Noun or Adjective $ - Select other Adjectives/Nouns within a threshold of the "first word" $ - From this selection, prepare n-grams such that the last word of N-Gram must be a Noun.

    Definition Classes
    OpenNlpExtractorExtractor
  18. def fatal(message: ⇒ String, ex: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  19. def fatal(message: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  20. def fatal(ex: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  21. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  22. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  23. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  24. def info(message: ⇒ String, ex: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  25. def info(message: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  26. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  27. def isStop(word: String): Boolean

    Attributes
    protected
  28. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  29. final def notify(): Unit

    Definition Classes
    AnyRef
  30. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  31. def posModel: POSModel

    Attributes
    protected
  32. def scrub(str: String): String

    str
    returns

    Attributes
    protected
  33. def sentenceModel: SentenceModel

    Attributes
    protected
  34. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  35. def toString(): String

    Definition Classes
    AnyRef → Any
  36. def tokenizerModel: TokenizerModel

    Attributes
    protected
  37. def trace(message: ⇒ String, ex: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  38. def trace(message: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  39. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  40. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  41. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  42. def warn(message: ⇒ String, ex: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Loggable
  43. def warn(message: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Loggable

Inherited from Loggable

Inherited from Extractor

Inherited from AnyRef

Inherited from Any

Ungrouped