CoreNLPDocument

class CoreNLPDocument(sentences: Array[Sentence]) extends Document

User: mihais Date: 3/2/13

Companion:
object
class Document
trait Serializable
class Object
trait Matchable
class Any

Value members

Concrete methods

def assimilate(document: CoreNLPDocument, textOpt: Option[String]): CoreNLPDocument
override def clear(): Unit
Definition Classes
Document
override def copy(sentences: Array[Sentence], textOpt: Option[String]): CoreNLPDocument
Definition Classes
Document

Inherited methods

def addAttachment(name: String, attachment: DocumentAttachment): Unit

Adds an attachment to the document's attachment map

Adds an attachment to the document's attachment map

Inherited from:
Document
Inherited from:
Document
def assimilate(document: Document, textOpt: Option[String]): Document
Inherited from:
Document

Used to compare Documents.

Used to compare Documents.

Returns:

a hash (Int) based primarily on the sentences, ignoring attachments

Inherited from:
Document
def getAttachment(name: String): Option[DocumentAttachment]

Retrieves the attachment with the given name

Retrieves the attachment with the given name

Inherited from:
Document
def getAttachmentKeys: Set[String]

Retrieves keys to all attachments so that the entire collection can be read for purposes including but not limited to serialization. If there are no attachments, that is attachments == None, an empty set is returned. This does not distinguish between None and Some(HashMap.empty), especially since the latter should not be possible because of the lazy initialization.

Retrieves keys to all attachments so that the entire collection can be read for purposes including but not limited to serialization. If there are no attachments, that is attachments == None, an empty set is returned. This does not distinguish between None and Some(HashMap.empty), especially since the latter should not be possible because of the lazy initialization.

Inherited from:
Document
def getDCT: Option[String]
Inherited from:
Document
def offset(offset: Int): Document
Inherited from:
Document
def prettyPrint(pw: PrintWriter): Unit
Inherited from:
Document
def removeAttachment(name: String): Unit
Inherited from:
Document
def setDCT(dct: String): Unit

Sets the document creation time using the CoreNLP format. See useFixedDate here for more details: https://stanfordnlp.github.io/CoreNLP/ner.html#setting-document-date The DCT will impacts how Sentence.norms are generated for DATE expressions

Sets the document creation time using the CoreNLP format. See useFixedDate here for more details: https://stanfordnlp.github.io/CoreNLP/ner.html#setting-document-date The DCT will impacts how Sentence.norms are generated for DATE expressions

Value parameters:
dct

Document creation time

Inherited from:
Document

Concrete fields

var annotation: Option[Annotation]

Inherited fields

var coreferenceChains: Option[CorefChains]

Clusters of coreferent mentions

Clusters of coreferent mentions

Inherited from:
Document
var id: Option[String]

Unique id for this document, if any

Unique id for this document, if any

Inherited from:
Document
var text: Option[String]

The original text corresponding to this document, if it was preserved by the corresponding processor

The original text corresponding to this document, if it was preserved by the corresponding processor

Inherited from:
Document