Class XPSTextExtractor
- java.lang.Object
-
- org.apache.poi.extractor.POITextExtractor
-
- org.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
- org.apache.tika.parser.microsoft.ooxml.xps.XPSTextExtractor
-
- All Implemented Interfaces:
Closeable
,AutoCloseable
public class XPSTextExtractor extends org.apache.poi.ooxml.extractor.POIXMLTextExtractor
Currently, mostly a pass-through class to hold pkg and properties and keep the general framework similar to our other POI-integrated extractors.
-
-
Constructor Summary
Constructors Constructor Description XPSTextExtractor(OPCPackage pkg)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description POIXMLProperties.CoreProperties
getCoreProperties()
POIXMLProperties.CustomProperties
getCustomProperties()
POIXMLProperties.ExtendedProperties
getExtendedProperties()
OPCPackage
getPackage()
String
getText()
Retrieves all the text from the document.-
Methods inherited from class org.apache.poi.ooxml.extractor.POIXMLTextExtractor
close, getDocument, getMetadataTextExtractor
-
Methods inherited from class org.apache.poi.extractor.POITextExtractor
setFilesystem
-
-
-
-
Constructor Detail
-
XPSTextExtractor
public XPSTextExtractor(OPCPackage pkg) throws OpenXML4JException, XmlException, IOException
-
-
Method Detail
-
getPackage
public OPCPackage getPackage()
- Overrides:
getPackage
in classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
getText
public String getText()
Description copied from class:POITextExtractor
Retrieves all the text from the document. How cells, paragraphs etc are separated in the text is implementation specific - see the javadocs for a specific project for details.- Specified by:
getText
in classPOITextExtractor
- Returns:
- All the text from the document
-
getCoreProperties
public POIXMLProperties.CoreProperties getCoreProperties()
- Overrides:
getCoreProperties
in classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
getExtendedProperties
public POIXMLProperties.ExtendedProperties getExtendedProperties()
- Overrides:
getExtendedProperties
in classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
getCustomProperties
public POIXMLProperties.CustomProperties getCustomProperties()
- Overrides:
getCustomProperties
in classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
-