Class OldExcelParser

java.lang.Object
org.apache.tika.parser.microsoft.OldExcelParser
All Implemented Interfaces:
Serializable, org.apache.tika.parser.Parser

public class OldExcelParser extends Object implements org.apache.tika.parser.Parser
A POI-powered Tika Parser for very old versions of Excel, from pre-OLE2 days, such as Excel 4.
See Also:
  • Constructor Summary

    Constructors
    Constructor
    Description
     
  • Method Summary

    Modifier and Type
    Method
    Description
    Set<org.apache.tika.mime.MediaType>
    getSupportedTypes(org.apache.tika.parser.ParseContext context)
     
    void
    parse(InputStream stream, ContentHandler handler, org.apache.tika.metadata.Metadata metadata, org.apache.tika.parser.ParseContext context)
    Extracts properties and text from an MS Document input stream
    protected static void
    parse(org.apache.poi.hssf.extractor.OldExcelExtractor extractor, org.apache.tika.sax.XHTMLContentHandler xhtml)
     

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Constructor Details

    • OldExcelParser

      public OldExcelParser()
  • Method Details

    • parse

      protected static void parse(org.apache.poi.hssf.extractor.OldExcelExtractor extractor, org.apache.tika.sax.XHTMLContentHandler xhtml) throws org.apache.tika.exception.TikaException, IOException, SAXException
      Throws:
      org.apache.tika.exception.TikaException
      IOException
      SAXException
    • getSupportedTypes

      public Set<org.apache.tika.mime.MediaType> getSupportedTypes(org.apache.tika.parser.ParseContext context)
      Specified by:
      getSupportedTypes in interface org.apache.tika.parser.Parser
    • parse

      public void parse(InputStream stream, ContentHandler handler, org.apache.tika.metadata.Metadata metadata, org.apache.tika.parser.ParseContext context) throws IOException, SAXException, org.apache.tika.exception.TikaException
      Extracts properties and text from an MS Document input stream
      Specified by:
      parse in interface org.apache.tika.parser.Parser
      Throws:
      IOException
      SAXException
      org.apache.tika.exception.TikaException