Class XWPFEventBasedWordExtractor
- java.lang.Object
-
- org.apache.poi.extractor.POITextExtractor
-
- org.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
- org.apache.tika.parser.microsoft.ooxml.xwpf.XWPFEventBasedWordExtractor
-
- All Implemented Interfaces:
Closeable
,AutoCloseable
public class XWPFEventBasedWordExtractor extends org.apache.poi.ooxml.extractor.POIXMLTextExtractor
Experimental class that is based on POI's XSSFEventBasedExcelExtractor
-
-
Constructor Summary
Constructors Constructor Description XWPFEventBasedWordExtractor(String path)
XWPFEventBasedWordExtractor(OPCPackage container)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description POIXMLProperties.CoreProperties
getCoreProperties()
POIXMLProperties.CustomProperties
getCustomProperties()
POIXMLProperties.ExtendedProperties
getExtendedProperties()
OPCPackage
getPackage()
String
getText()
Retrieves all the text from the document.static void
main(String[] args)
-
Methods inherited from class org.apache.poi.ooxml.extractor.POIXMLTextExtractor
close, getDocument, getMetadataTextExtractor
-
Methods inherited from class org.apache.poi.extractor.POITextExtractor
setFilesystem
-
-
-
-
Constructor Detail
-
XWPFEventBasedWordExtractor
public XWPFEventBasedWordExtractor(String path) throws XmlException, OpenXML4JException, IOException
-
XWPFEventBasedWordExtractor
public XWPFEventBasedWordExtractor(OPCPackage container) throws XmlException, OpenXML4JException, IOException
-
-
Method Detail
-
getPackage
public OPCPackage getPackage()
- Overrides:
getPackage
in classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
getCoreProperties
public POIXMLProperties.CoreProperties getCoreProperties()
- Overrides:
getCoreProperties
in classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
getExtendedProperties
public POIXMLProperties.ExtendedProperties getExtendedProperties()
- Overrides:
getExtendedProperties
in classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
getCustomProperties
public POIXMLProperties.CustomProperties getCustomProperties()
- Overrides:
getCustomProperties
in classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
getText
public String getText()
Description copied from class:POITextExtractor
Retrieves all the text from the document. How cells, paragraphs etc are separated in the text is implementation specific - see the javadocs for a specific project for details.- Specified by:
getText
in classPOITextExtractor
- Returns:
- All the text from the document
-
-