Class SXWPFWordExtractorDecorator
- java.lang.Object
-
- org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
-
- org.apache.tika.parser.microsoft.ooxml.SXWPFWordExtractorDecorator
-
- All Implemented Interfaces:
OOXMLExtractor
public class SXWPFWordExtractorDecorator extends AbstractOOXMLExtractor
This is an experimental, alternative extractor for docx files. This streams the main document content rather than loading the full document into memory.This will be better for some use cases than the classic docx extractor; and, it will be worse for others.
- Since:
- 1.15
-
-
Constructor Summary
Constructors Constructor Description SXWPFWordExtractorDecorator(Metadata metadata, ParseContext context, XWPFEventBasedWordExtractor extractor)
-
Method Summary
-
Methods inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
getDocument, getMetadataExtractor, getXHTML
-
-
-
-
Constructor Detail
-
SXWPFWordExtractorDecorator
public SXWPFWordExtractorDecorator(Metadata metadata, ParseContext context, XWPFEventBasedWordExtractor extractor)
-
-