Package org.apache.tika.parser
package org.apache.tika.parser
Tika parsers.
-
ClassDescriptionAbstract base class for parsers that use the AutoDetectReader and need to use the
EncodingDetector
configured byTikaConfig
Abstract base class for parsers that call external processes.Abstract base class for new parsers.Factory for an AutoDetectParserComposite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document.Decrypts the incoming document stream and delegates further parsing to another parser instance.A composite parser based on all theParser
implementations available through theservice provider mechanism
.Base class for parser implementations that want to delegate parts of the task of parsing an input document to another parser.Interface for digester.Encodes byte array from a MessageDigest to StringDummy parser that always produces an empty XHTML document without even attempting to parse the given document stream.Dummy parser that always throws aTikaException
without even attempting to parse the given document stream.Parse context.Tika parser interface.Decorator base class for theParser
interface.Parser decorator that post-processes the results from a decorated parser.Reader for the text content from a given binary stream.Interface for providing a password to a Parser for handling Encrypted and Password Protected Documents.This is a helper class that wraps a parser in a recursive handler.