Class VisioTextExtractor

  • All Implemented Interfaces:
    java.io.Closeable, java.lang.AutoCloseable, POIOLE2TextExtractor, POITextExtractor

    public final class VisioTextExtractor
    extends java.lang.Object
    implements POIOLE2TextExtractor
    Class to find all the text in a Visio file, and return it. Can operate on the command line (outputs to stdout), or can return the text for you (example: for use with Lucene).
    • Constructor Detail

      • VisioTextExtractor

        public VisioTextExtractor​(HDGFDiagram hdgf)
      • VisioTextExtractor

        public VisioTextExtractor​(POIFSFileSystem fs)
                           throws java.io.IOException
        Throws:
        java.io.IOException
      • VisioTextExtractor

        public VisioTextExtractor​(DirectoryNode dir)
                           throws java.io.IOException
        Throws:
        java.io.IOException
      • VisioTextExtractor

        public VisioTextExtractor​(java.io.InputStream inp)
                           throws java.io.IOException
        Throws:
        java.io.IOException
    • Method Detail

      • getAllText

        public java.lang.String[] getAllText()
        Locates all the text entries in the file, and returns their contents.
        Returns:
        An array of each Text item in the document
      • getText

        public java.lang.String getText()
        Returns the textual contents of the file. Each textual object's text will be separated by a newline
        Specified by:
        getText in interface POITextExtractor
        Returns:
        All text contained in this document, separated by \n