Package com.basistech.rosette.dm
Interface Annotator
- All Known Implementing Classes:
AbstractAnnotator
,ComposingAnnotator
,WholeDocumentLanguageDispatchAnnotator
public interface Annotator
An
Annotator
annotates text with attributes. Typical annotations
are tokens, script/language regions, morphological analyses, sentences,
base noun phrases, and entities. A single annotator operates in one
language; while the data model supports multi-language texts, this
interface does not (yet) provide support.-
Method Summary
Modifier and TypeMethodDescriptionannotate
(AnnotatedText input) Annotates an existing text object with additional attributes.annotate
(CharSequence input) Annotates raw text with attributes.
-
Method Details
-
annotate
Annotates raw text with attributes. For example, a base linguistics annotator may accept raw text and annotate it with token attributes.- Parameters:
input
- data to process- Returns:
- annotated data
-
annotate
Annotates an existing text object with additional attributes. For example, an entity annotator may take the output of a base linguistics annotator (which has token annotations) and add additional entities attributes.- Parameters:
input
- data to process- Returns:
- annotated data
-