Interface TextStructureLayer

  • All Superinterfaces:
    TextCorpusLayer
    All Known Implementing Classes:
    TextStructureLayerStored

    public interface TextStructureLayer
    extends TextCorpusLayer
    The TextStructureLayer preserves the original structure of a written text. Within the layer, a token sequence can be annotated as belonging to some text structure element, such as to a page, a paragraph, a line, a title, etc. textspan element represents text structure annotation on a token sequence. The token sequence is specified by start token id and end token id. The type of the text structure element is specified by the corresponding attribute.
    Author:
    Yana Panchenko