Class TextCorpusStreamed

  • All Implemented Interfaces:
    TextCorpus, Closeable, AutoCloseable

    public class TextCorpusStreamed
    extends TextCorpusStored
    implements Closeable
    Class TextCorpusStreamed is used for accessing specified annotation layers and (optionally) adding any new annotation layers from/to TextCorpus. Only specified in the constructor annotation layers are loaded into the memory. In case all the annotation layers should be loaded into the memory, use WLData class.
    Author:
    Yana Panchenko
    • Constructor Detail

      • TextCorpusStreamed

        public TextCorpusStreamed​(InputStream inputStream,
                                  EnumSet<TextCorpusLayerTag> layersToRead)
                           throws WLFormatException
        Creates a TextCorpusStreamed from the given TCF input stream and specified annotation layers.
        Parameters:
        inputStream - the underlying input stream with linguistic annotations in TCF format.
        layersToRead - the annotation layers of TextCorpus that should be read into this TextCorpusStreamed.
        Throws:
        WLFormatException - if an error in input format or an I/O error occurs.
      • TextCorpusStreamed

        public TextCorpusStreamed​(InputStream inputStream,
                                  EnumSet<TextCorpusLayerTag> layersToRead,
                                  OutputStream outputStream)
                           throws WLFormatException
        Creates a TextCorpusStreamed from the given TCF input stream, specified annotation layers and the output stream.
        Parameters:
        inputStream - the underlying input stream with linguistic annotations in TCF format.
        layersToRead - the annotation layers of TextCorpus that should be read into this TextCorpusStreamed.
        outputStream - the underlying output stream into which the annotations from the input stream and any new created annotations will be written (in TCF format).
        Throws:
        WLFormatException - if an error in input format or an I/O error occurs.
      • TextCorpusStreamed

        public TextCorpusStreamed​(InputStream inputStream,
                                  EnumSet<TextCorpusLayerTag> layersToRead,
                                  OutputStream outputStream,
                                  boolean outputAsXmlFragment)
                           throws WLFormatException
        Creates a TextCorpusStreamed from the given TCF input stream, specified annotation layers and the output stream.
        Parameters:
        inputStream - the underlying input stream with linguistic annotations in TCF format.
        layersToRead - the annotation layers of TextCorpus that should be read into this TextCorpusStreamed.
        outputStream - the underlying output stream into which the annotations from the input stream and any new created annotations will be written (in TCF format).
        outputAsXmlFragment - true if the output should not contain xml headers, false otherwise.
        Throws:
        WLFormatException - if an error in input format or an I/O error occurs.
      • TextCorpusStreamed

        public TextCorpusStreamed​(InputStream inputStream,
                                  EnumSet<TextCorpusLayerTag> layersToRead,
                                  OutputStream outputStream,
                                  List<MetaDataItem> metaDataToAdd)
                           throws WLFormatException
        Creates a TextCorpusStreamed from the given TCF input stream, specified annotation layers, output stream and meta data.
        Parameters:
        inputStream - the underlying input stream with linguistic annotations in TCF format.
        layersToRead - the annotation layers of TextCorpus that should be read into this TextCorpusStreamed.
        outputStream - the underlying output stream into which the annotations from the input stream and any new created annotations will be written (in TCF format).
        metaDataToAdd - meta data to be added to the output TCF.
        Throws:
        WLFormatException - if an error in input format or an I/O error occurs.
    • Method Detail

      • close

        public void close()
                   throws WLFormatException
        Closes the input and output streams associated with this object and releases any associated system resources. Before the streams are closed, all in-memory annotations of the TextCorpusStreamed and not-processed part of the input stream are written to the output stream. Therefore, it's important to call close() method, so that all the in-memory annotations are saved to the output stream. Once the TextCorpusStreamed has been closed, adding further annotations will have no effect on the output stream.
        Specified by:
        close in interface AutoCloseable
        Specified by:
        close in interface Closeable
        Throws:
        WLFormatException - if an error in input format or an I/O error occurs.