Class LibPstParser
- java.lang.Object
-
- org.apache.tika.parser.microsoft.libpst.LibPstParser
-
- All Implemented Interfaces:
Serializable,org.apache.tika.config.Initializable,org.apache.tika.parser.Parser
public class LibPstParser extends Object implements org.apache.tika.parser.Parser, org.apache.tika.config.Initializable
This is an optional PST parser that relies on the user installing the GPL-3 libpst/readpst commandline tool and configuring Tika to call this library via tika-config.xml- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description static org.apache.tika.mime.MediaTypeMS_OUTLOOK_PST_MIMETYPE
-
Constructor Summary
Constructors Constructor Description LibPstParser()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidcheckInitialization(org.apache.tika.config.InitializableProblemHandler initializableProblemHandler)booleancheckQuietly()Set<org.apache.tika.mime.MediaType>getSupportedTypes(org.apache.tika.parser.ParseContext parseContext)voidinitialize(Map<String,org.apache.tika.config.Param> map)voidparse(InputStream inputStream, ContentHandler contentHandler, org.apache.tika.metadata.Metadata metadata, org.apache.tika.parser.ParseContext parseContext)voidsetIncludeDeleted(boolean includeDeleted)voidsetMaxEmails(int maxEmails)voidsetProcessEmailAsMsg(boolean processEmailAsMsg)voidsetReadPstPath(String readPstPath)This should include the path up to but not including 'readpst', e.g.voidsetTimeoutSeconds(long timeoutSeconds)
-
-
-
Method Detail
-
getSupportedTypes
public Set<org.apache.tika.mime.MediaType> getSupportedTypes(org.apache.tika.parser.ParseContext parseContext)
- Specified by:
getSupportedTypesin interfaceorg.apache.tika.parser.Parser
-
parse
public void parse(InputStream inputStream, ContentHandler contentHandler, org.apache.tika.metadata.Metadata metadata, org.apache.tika.parser.ParseContext parseContext) throws IOException, SAXException, org.apache.tika.exception.TikaException
- Specified by:
parsein interfaceorg.apache.tika.parser.Parser- Throws:
IOExceptionSAXExceptionorg.apache.tika.exception.TikaException
-
initialize
public void initialize(Map<String,org.apache.tika.config.Param> map) throws org.apache.tika.exception.TikaConfigException
- Specified by:
initializein interfaceorg.apache.tika.config.Initializable- Throws:
org.apache.tika.exception.TikaConfigException
-
checkInitialization
public void checkInitialization(org.apache.tika.config.InitializableProblemHandler initializableProblemHandler) throws org.apache.tika.exception.TikaConfigException- Specified by:
checkInitializationin interfaceorg.apache.tika.config.Initializable- Throws:
org.apache.tika.exception.TikaConfigException
-
checkQuietly
public boolean checkQuietly()
-
setTimeoutSeconds
@Field public void setTimeoutSeconds(long timeoutSeconds)
-
setProcessEmailAsMsg
@Field public void setProcessEmailAsMsg(boolean processEmailAsMsg)
-
setIncludeDeleted
@Field public void setIncludeDeleted(boolean includeDeleted)
-
setMaxEmails
@Field public void setMaxEmails(int maxEmails)
-
setReadPstPath
@Field public void setReadPstPath(String readPstPath)
This should include the path up to but not including 'readpst', e.g. "C:\my_bin" where readpst is at "C:\my_bin\readpst"- Parameters:
readPstPath-
-
-