public class XWPFWordExtractor
extends org.apache.poi.ooxml.extractor.POIXMLTextExtractor
Modifier and Type | Field and Description |
---|---|
static XWPFRelation[] |
SUPPORTED_TYPES |
Constructor and Description |
---|
XWPFWordExtractor(OPCPackage container) |
XWPFWordExtractor(XWPFDocument document) |
Modifier and Type | Method and Description |
---|---|
void |
appendBodyElementText(StringBuilder text,
IBodyElement e) |
void |
appendParagraphText(StringBuilder text,
XWPFParagraph paragraph) |
String |
getText()
Retrieves all the text from the document.
|
static void |
main(String[] args) |
void |
setConcatenatePhoneticRuns(boolean concatenatePhoneticRuns)
Should we concatenate phonetic runs in extraction.
|
void |
setFetchHyperlinks(boolean fetch)
Should we also fetch the hyperlinks, when fetching
the text content? Default is to only output the
hyperlink label, and not the contents
|
close, getCoreProperties, getCustomProperties, getDocument, getExtendedProperties, getMetadataTextExtractor, getPackage
setFilesystem
public static final XWPFRelation[] SUPPORTED_TYPES
public XWPFWordExtractor(OPCPackage container) throws XmlException, OpenXML4JException, IOException
public XWPFWordExtractor(XWPFDocument document)
public void setFetchHyperlinks(boolean fetch)
public void setConcatenatePhoneticRuns(boolean concatenatePhoneticRuns)
true
concatenatePhoneticRuns
- public String getText()
POITextExtractor
getText
in class POITextExtractor
public void appendBodyElementText(StringBuilder text, IBodyElement e)
public void appendParagraphText(StringBuilder text, XWPFParagraph paragraph)
Copyright © 2010 - 2020 Adobe. All Rights Reserved