Package | Description |
---|---|
org.apache.tika.fork |
Forked parser.
|
org.apache.tika.parser |
Tika parsers.
|
org.apache.tika.parser.external |
External parser process.
|
org.apache.tika.parser.multiple |
Modifier and Type | Class and Description |
---|---|
class |
ForkParser |
Modifier and Type | Class and Description |
---|---|
class |
AbstractEncodingDetectorParser
Abstract base class for parsers that use the AutoDetectReader and need
to use the
EncodingDetector configured by TikaConfig |
class |
AbstractExternalProcessParser
Abstract base class for parsers that call external processes.
|
class |
AutoDetectParser |
class |
CompositeParser
Composite parser that delegates parsing tasks to a component parser
based on the declared content type of the incoming document.
|
class |
CryptoParser
Decrypts the incoming document stream and delegates further parsing to
another parser instance.
|
class |
DefaultParser
A composite parser based on all the
Parser implementations
available through the
service provider mechanism . |
class |
DelegatingParser
Base class for parser implementations that want to delegate parts of the
task of parsing an input document to another parser.
|
class |
DigestingParser |
class |
EmptyParser
Dummy parser that always produces an empty XHTML document without even
attempting to parse the given document stream.
|
class |
ErrorParser
Dummy parser that always throws a
TikaException without even
attempting to parse the given document stream. |
class |
NetworkParser |
class |
ParserDecorator
Decorator base class for the
Parser interface. |
class |
ParserPostProcessor
Parser decorator that post-processes the results from a decorated parser.
|
class |
RecursiveParserWrapper
This is a helper class that wraps a parser in a recursive handler.
|
class |
StatefulParser
The RecursiveParserWrapper wraps the parser sent
into the parsecontext and then uses that parser
to store state (among many other things).
|
Modifier and Type | Class and Description |
---|---|
class |
CompositeExternalParser
A Composite Parser that wraps up all the available External Parsers,
and provides an easy way to access them.
|
class |
ExternalParser
Parser that uses an external program (like catdoc or pdf2txt) to extract
text content and metadata from a given document.
|
Modifier and Type | Class and Description |
---|---|
class |
AbstractMultipleParser
Abstract base class for parser wrappers which may / will
process a given stream multiple times, merging the results
of the various parsers used.
|
class |
FallbackParser
Tries multiple parsers in turn, until one succeeds.
|
class |
SupplementingParser
Runs the input stream through all available parsers,
merging the metadata from them based on the
AbstractMultipleParser.MetadataPolicy chosen. |
Copyright © 2007–2021 The Apache Software Foundation. All rights reserved.