Class OldExcelParser

  • All Implemented Interfaces:
    Serializable, org.apache.tika.parser.Parser

    public class OldExcelParser
    extends Object
    implements org.apache.tika.parser.Parser
    A POI-powered Tika Parser for very old versions of Excel, from pre-OLE2 days, such as Excel 4.
    See Also:
    Serialized Form
    • Constructor Detail

      • OldExcelParser

        public OldExcelParser()
    • Method Detail

      • parse

        protected static void parse​(org.apache.poi.hssf.extractor.OldExcelExtractor extractor,
                                    org.apache.tika.sax.XHTMLContentHandler xhtml)
                             throws org.apache.tika.exception.TikaException,
                                    IOException,
                                    SAXException
        Throws:
        org.apache.tika.exception.TikaException
        IOException
        SAXException
      • getSupportedTypes

        public Set<org.apache.tika.mime.MediaType> getSupportedTypes​(org.apache.tika.parser.ParseContext context)
        Specified by:
        getSupportedTypes in interface org.apache.tika.parser.Parser
      • parse

        public void parse​(InputStream stream,
                          ContentHandler handler,
                          org.apache.tika.metadata.Metadata metadata,
                          org.apache.tika.parser.ParseContext context)
                   throws IOException,
                          SAXException,
                          org.apache.tika.exception.TikaException
        Extracts properties and text from an MS Document input stream
        Specified by:
        parse in interface org.apache.tika.parser.Parser
        Throws:
        IOException
        SAXException
        org.apache.tika.exception.TikaException