edu.arizona.sista.processors.fastnlp

FastNLPProcessor

Related Docs: object FastNLPProcessor | package fastnlp

class FastNLPProcessor extends ShallowNLPProcessor

Fast NLP tools Extends ShallowNLP with a dependency parser based on maltparser or the Stanford NN dependency parser This means that constituent trees and coreference, which depends on that, are not available The default setting is to use the Stanford parser with "basic" dependencies Malt produces ONLY Stanford "basic" dependencies, rather than "collapsed" ones User: mihais Date: 1/4/14

Linear Supertypes
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. FastNLPProcessor
  2. ShallowNLPProcessor
  3. Processor
  4. AnyRef
  5. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new FastNLPProcessor(internStrings: Boolean = true, useMalt: Boolean = false, withDiscourse: Boolean = false)

Value Members

  1. final def !=(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  4. def annotate(doc: Document): Document

    Definition Classes
    Processor
  5. def annotate(text: String, keepText: Boolean = false): Document

    Definition Classes
    Processor
  6. def annotateFromSentences(sentences: Iterable[String], keepText: Boolean = false): Document

    Definition Classes
    Processor
  7. def annotateFromTokens(sentences: Iterable[Iterable[String]], keepText: Boolean = false): Document

    Definition Classes
    Processor
  8. def arrayOrNone[T](b: ArrayBuffer[T])(implicit arg0: ClassTag[T]): Option[Array[T]]

    Definition Classes
    ShallowNLPProcessor
  9. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  10. def basicSanityCheck(doc: Document, checkAnnotation: Boolean = true): Option[Annotation]

    Definition Classes
    ShallowNLPProcessor
  11. lazy val chunker: CRFChunker

    Definition Classes
    ShallowNLPProcessor
  12. def chunking(doc: Document): Unit

    Shallow parsing; modifies the document in place

    Shallow parsing; modifies the document in place

    Definition Classes
    ShallowNLPProcessorProcessor
  13. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  14. def discourse(doc: Document): Unit

    Discourse parsing; modifies the document in place

    Discourse parsing; modifies the document in place

    Definition Classes
    FastNLPProcessorShallowNLPProcessorProcessor
  15. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  16. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  17. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  18. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  19. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  20. def in(s: String): String

    Definition Classes
    ShallowNLPProcessor
  21. val internStrings: Boolean

    Definition Classes
    ShallowNLPProcessor
  22. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  23. def labelSemanticRoles(doc: Document): Unit

    SRL; modifies the document in place

    SRL; modifies the document in place

    Definition Classes
    ShallowNLPProcessorProcessor
  24. def lemmatize(doc: Document): Unit

    Lematization; modifies the document in place

    Lematization; modifies the document in place

    Definition Classes
    ShallowNLPProcessorProcessor
  25. lazy val lemmatizer: StanfordCoreNLP

    Definition Classes
    ShallowNLPProcessor
  26. lazy val maltService: ThreadLocal[MaltParserService]

    One maltparser instance for each thread MUST have one separate malt instance per thread! malt uses a working directory which is written at runtime using ThreadLocal variables guarantees that each thread gets its own working directory

  27. def mkChunker: CRFChunker

    Definition Classes
    ShallowNLPProcessor
  28. def mkDocument(text: String, keepText: Boolean): Document

    Constructs a document of tokens from free text; includes sentence splitting and tokenization

    Constructs a document of tokens from free text; includes sentence splitting and tokenization

    Definition Classes
    ShallowNLPProcessorProcessor
  29. def mkDocumentFromSentences(sentences: Iterable[String], keepText: Boolean, charactersBetweenSentences: Int = 1): Document

    Constructs a document of tokens from an array of untokenized sentences

    Constructs a document of tokens from an array of untokenized sentences

    Definition Classes
    ShallowNLPProcessorProcessor
  30. def mkDocumentFromTokens(sentences: Iterable[Iterable[String]], keepText: Boolean, charactersBetweenSentences: Int = 1, charactersBetweenTokens: Int = 1): Document

    Constructs a document of tokens from an array of tokenized sentences

    Constructs a document of tokens from an array of tokenized sentences

    Definition Classes
    ShallowNLPProcessorProcessor
  31. def mkLemmatizer: StanfordCoreNLP

    Definition Classes
    ShallowNLPProcessor
  32. def mkNer: StanfordCoreNLP

    Definition Classes
    ShallowNLPProcessor
  33. def mkPosTagger: StanfordCoreNLP

    Definition Classes
    ShallowNLPProcessor
  34. def mkSentence(annotation: CoreMap): Sentence

    Definition Classes
    ShallowNLPProcessor
  35. def mkTokenizerWithSentenceSplitting: StanfordCoreNLP

    Definition Classes
    ShallowNLPProcessor
  36. def mkTokenizerWithoutSentenceSplitting: StanfordCoreNLP

    Definition Classes
    ShallowNLPProcessor
  37. def namedEntitySanityCheck(doc: Document): Option[Annotation]

    Definition Classes
    ShallowNLPProcessor
  38. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  39. lazy val ner: StanfordCoreNLP

    Definition Classes
    ShallowNLPProcessor
  40. final def notify(): Unit

    Definition Classes
    AnyRef
  41. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  42. def parse(doc: Document): Unit

    Syntactic parsing; modifies the document in place

    Syntactic parsing; modifies the document in place

    Definition Classes
    FastNLPProcessorShallowNLPProcessorProcessor
  43. lazy val posTagger: StanfordCoreNLP

    Definition Classes
    ShallowNLPProcessor
  44. def postprocessTags(annotation: Annotation): Unit

    Hook to allow postprocessing of CoreNLP POS tagging *in place*, overwriting original POS tags This is useful for domain-specific corrections

    Hook to allow postprocessing of CoreNLP POS tagging *in place*, overwriting original POS tags This is useful for domain-specific corrections

    annotation

    The CoreNLP annotation

    Definition Classes
    ShallowNLPProcessor
  45. def postprocessTokens(sentence: CoreMap): List[CoreLabel]

    Attributes
    protected
    Definition Classes
    ShallowNLPProcessor
  46. def postprocessTokens(originalTokens: Array[CoreLabel]): Array[CoreLabel]

    Hook to allow postprocessing of CoreNLP tokenization This is useful for domain-specific corrections, such as the ones in BioNLPProcessor If you change the tokens, make sure to store them back in the sentence!

    Hook to allow postprocessing of CoreNLP tokenization This is useful for domain-specific corrections, such as the ones in BioNLPProcessor If you change the tokens, make sure to store them back in the sentence!

    originalTokens

    Input CoreNLP sentence

    returns

    The modified tokens

    Definition Classes
    ShallowNLPProcessor
  47. def preprocessSentences(origSentences: Iterable[String]): Iterable[String]

    Runs preprocessText on each sentence

    Runs preprocessText on each sentence

    Definition Classes
    Processor
  48. def preprocessText(origText: String): String

    Hook to allow the preprocessing of input text This is useful for domain-specific corrections, such as the ones in BioNLPProcessor, where we remove Table and Fig references Note that this is allowed to change character offsets

    Hook to allow the preprocessing of input text This is useful for domain-specific corrections, such as the ones in BioNLPProcessor, where we remove Table and Fig references Note that this is allowed to change character offsets

    origText

    The original input text

    returns

    The preprocessed text

    Definition Classes
    Processor
  49. def preprocessTokens(origSentences: Iterable[Iterable[String]]): Iterable[Iterable[String]]

    Runs preprocessText on each token

    Runs preprocessText on each token

    Definition Classes
    Processor
  50. def recognizeNamedEntities(doc: Document): Unit

    NER; modifies the document in place

    NER; modifies the document in place

    Definition Classes
    ShallowNLPProcessorProcessor
  51. def resolveCoreference(doc: Document): Unit

    Coreference resolution; modifies the document in place

    Coreference resolution; modifies the document in place

    Definition Classes
    ShallowNLPProcessorProcessor
  52. lazy val rstDependencyParser: RSTParser

    RST discourse parser using only dependency based syntax

  53. lazy val stanfordDepParser: DependencyParser

    Stanford's NN dependency parser

  54. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  55. def tagPartsOfSpeech(doc: Document): Unit

    Part of speech tagging This modifies the document in place, which is not too elegant.

    Part of speech tagging This modifies the document in place, which is not too elegant. But there are two reasons for this: (a) Some annotators (e.g., Stanford's CoreNLP) require some state (i.e., their Annotation object) to be passed between operations; (b) This is more efficient during annotate() where all the possible operations are chained.

    Definition Classes
    ShallowNLPProcessorProcessor
  56. def toString(): String

    Definition Classes
    AnyRef → Any
  57. lazy val tokenizerWithSentenceSplitting: StanfordCoreNLP

    Definition Classes
    ShallowNLPProcessor
  58. lazy val tokenizerWithoutSentenceSplitting: StanfordCoreNLP

    Definition Classes
    ShallowNLPProcessor
  59. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  60. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  61. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from ShallowNLPProcessor

Inherited from Processor

Inherited from AnyRef

Inherited from Any

Ungrouped