Package org.apache.poi.hwpf.extractor
Class Word6Extractor
- java.lang.Object
-
- org.apache.poi.extractor.POITextExtractor
-
- org.apache.poi.extractor.POIOLE2TextExtractor
-
- org.apache.poi.hwpf.extractor.Word6Extractor
-
- All Implemented Interfaces:
Closeable
,AutoCloseable
public final class Word6Extractor extends POIOLE2TextExtractor
Class to extract the text from old (Word 6 / Word 95) Word Documents. This should only be used on the older files, for most uses you should callWordExtractor
which deals properly with HWPF.
-
-
Constructor Summary
Constructors Constructor Description Word6Extractor(InputStream is)
Create a new Word ExtractorWord6Extractor(HWPFOldDocument doc)
Create a new Word ExtractorWord6Extractor(DirectoryNode dir)
Word6Extractor(DirectoryNode dir, POIFSFileSystem fs)
Deprecated.UseWord6Extractor(DirectoryNode)
insteadWord6Extractor(POIFSFileSystem fs)
Create a new Word Extractor
-
Method Summary
All Methods Instance Methods Concrete Methods Deprecated Methods Modifier and Type Method Description String[]
getParagraphText()
Deprecated.String
getText()
Retrieves all the text from the document.-
Methods inherited from class org.apache.poi.extractor.POIOLE2TextExtractor
getDocSummaryInformation, getDocument, getMetadataTextExtractor, getRoot, getSummaryInformation
-
Methods inherited from class org.apache.poi.extractor.POITextExtractor
close, setFilesystem
-
-
-
-
Constructor Detail
-
Word6Extractor
public Word6Extractor(InputStream is) throws IOException
Create a new Word Extractor- Parameters:
is
- InputStream containing the word file- Throws:
IOException
-
Word6Extractor
public Word6Extractor(POIFSFileSystem fs) throws IOException
Create a new Word Extractor- Parameters:
fs
- POIFSFileSystem containing the word file- Throws:
IOException
-
Word6Extractor
@Deprecated public Word6Extractor(DirectoryNode dir, POIFSFileSystem fs) throws IOException
Deprecated.UseWord6Extractor(DirectoryNode)
instead- Throws:
IOException
-
Word6Extractor
public Word6Extractor(DirectoryNode dir) throws IOException
- Throws:
IOException
-
Word6Extractor
public Word6Extractor(HWPFOldDocument doc)
Create a new Word Extractor- Parameters:
doc
- The HWPFOldDocument to extract from
-
-
Method Detail
-
getParagraphText
@Deprecated public String[] getParagraphText()
Deprecated.Get the text from the word file, as an array with one String per paragraph
-
getText
public String getText()
Description copied from class:POITextExtractor
Retrieves all the text from the document. How cells, paragraphs etc are separated in the text is implementation specific - see the javadocs for a specific project for details.- Specified by:
getText
in classPOITextExtractor
- Returns:
- All the text from the document
-
-