Package org.apache.tika.parser.external
Class ExternalParsersConfigReader
- java.lang.Object
-
- org.apache.tika.parser.external.ExternalParsersConfigReader
-
- All Implemented Interfaces:
ExternalParsersConfigReaderMetKeys
public final class ExternalParsersConfigReader extends java.lang.Object implements ExternalParsersConfigReaderMetKeys
Builds up ExternalParser instances based on XML file(s) which define what to run, for what, and how to process any output metadata. Typically used to configure up a series of external programs (like catdoc or pdf2txt) to extract text content from documents.TODO XML DTD Here
-
-
Field Summary
-
Fields inherited from interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
CHECK_TAG, COMMAND_TAG, ERROR_CODES_TAG, EXTERNAL_PARSERS_TAG, METADATA_KEY_ATTR, METADATA_MATCH_TAG, METADATA_TAG, MIMETYPE_TAG, MIMETYPES_TAG, PARSER_TAG
-
-
Constructor Summary
Constructors Constructor Description ExternalParsersConfigReader()
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static java.util.List<ExternalParser>
read(java.io.InputStream stream)
static java.util.List<ExternalParser>
read(org.w3c.dom.Document document)
static java.util.List<ExternalParser>
read(org.w3c.dom.Element element)
-
-
-
Method Detail
-
read
public static java.util.List<ExternalParser> read(java.io.InputStream stream) throws TikaException, java.io.IOException
- Throws:
TikaException
java.io.IOException
-
read
public static java.util.List<ExternalParser> read(org.w3c.dom.Document document) throws TikaException, java.io.IOException
- Throws:
TikaException
java.io.IOException
-
read
public static java.util.List<ExternalParser> read(org.w3c.dom.Element element) throws TikaException, java.io.IOException
- Throws:
TikaException
java.io.IOException
-
-