Class OldExcelExtractor

  • All Implemented Interfaces:
    java.io.Closeable, java.lang.AutoCloseable, POITextExtractor

    public class OldExcelExtractor
    extends java.lang.Object
    implements POITextExtractor
    A text extractor for old Excel files, which are too old for HSSFWorkbook to handle. This includes Excel 95, and very old (pre-OLE2) Excel files, such as Excel 4 files.

    Returns much (but not all) of the textual content of the file, suitable for indexing by something like Apache Lucene, or used by Apache Tika, but not really intended for display to the user.

    • Constructor Detail

      • OldExcelExtractor

        public OldExcelExtractor​(java.io.InputStream input)
                          throws java.io.IOException
        Throws:
        java.io.IOException
      • OldExcelExtractor

        public OldExcelExtractor​(java.io.File f)
                          throws java.io.IOException
        Throws:
        java.io.IOException
      • OldExcelExtractor

        public OldExcelExtractor​(POIFSFileSystem fs)
                          throws java.io.IOException
        Throws:
        java.io.IOException
      • OldExcelExtractor

        public OldExcelExtractor​(DirectoryNode directory)
                          throws java.io.IOException
        Throws:
        java.io.IOException