Package org.apache.poi.hwpf.converter
Class WordToHtmlConverter
- java.lang.Object
-
- org.apache.poi.hwpf.converter.AbstractWordConverter
-
- org.apache.poi.hwpf.converter.WordToHtmlConverter
-
@Beta public class WordToHtmlConverter extends AbstractWordConverter
Converts Word files (95-2007) into HTML files.This implementation doesn't create images or links to them. This can be changed by overriding
AbstractWordConverter.processImage(Element, boolean, Picture)
method.
-
-
Field Summary
-
Fields inherited from class org.apache.poi.hwpf.converter.AbstractWordConverter
UNICODECHAR_NO_BREAK_SPACE, UNICODECHAR_NONBREAKING_HYPHEN, UNICODECHAR_ZERO_WIDTH_SPACE
-
-
Constructor Summary
Constructors Constructor Description WordToHtmlConverter(HtmlDocumentFacade htmlDocumentFacade)
WordToHtmlConverter(Document document)
Creates new instance of WordToHtmlConverter.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description protected void
afterProcess()
Special actions that need to be called after processing complete, like updating stylesheets or building document notes list.Document
getDocument()
static void
main(String[] args)
Java main() interface to interact with WordToHtmlConverterprotected void
outputCharacters(Element pElement, CharacterRun characterRun, String text)
protected void
processBookmarks(HWPFDocumentCore wordDocument, Element currentBlock, Range range, int currentTableLevel, List<Bookmark> rangeBookmarks)
Wrap range into bookmark(s) and process it.protected void
processDocumentInformation(SummaryInformation summaryInformation)
void
processDocumentPart(HWPFDocumentCore wordDocument, Range range)
protected void
processDrawnObject(HWPFDocument doc, CharacterRun characterRun, OfficeDrawing officeDrawing, String path, Element block)
protected void
processDropDownList(Element block, CharacterRun characterRun, String[] values, int defaultIndex)
protected void
processEndnoteAutonumbered(HWPFDocument wordDocument, int noteIndex, Element block, Range endnoteTextRange)
protected void
processFootnoteAutonumbered(HWPFDocument wordDocument, int noteIndex, Element block, Range footnoteTextRange)
protected void
processHyperlink(HWPFDocumentCore wordDocument, Element currentBlock, Range textRange, int currentTableLevel, String hyperlink)
protected void
processImage(Element currentBlock, boolean inlined, Picture picture, String imageSourcePath)
protected void
processImageWithoutPicturesManager(Element currentBlock, boolean inlined, Picture picture)
protected void
processLineBreak(Element block, CharacterRun characterRun)
protected void
processNoteAutonumbered(HWPFDocument doc, String type, int noteIndex, Element block, Range noteTextRange)
protected void
processPageBreak(HWPFDocumentCore wordDocument, Element flow)
protected void
processPageref(HWPFDocumentCore hwpfDocument, Element currentBlock, Range textRange, int currentTableLevel, String pageref)
protected void
processParagraph(HWPFDocumentCore hwpfDocument, Element parentElement, int currentTableLevel, Paragraph paragraph, String bulletText)
protected void
processSection(HWPFDocumentCore wordDocument, Section section, int sectionCounter)
protected void
processSingleSection(HWPFDocumentCore wordDocument, Section section)
protected void
processTable(HWPFDocumentCore hwpfDocument, Element flow, Table table)
-
Methods inherited from class org.apache.poi.hwpf.converter.AbstractWordConverter
getCharacterRunTriplet, getFontReplacer, getNumberColumnsSpanned, getNumberRowsSpanned, getPicturesManager, processCharacters, processDeadField, processDocument, processDrawnObject, processField, processImage, processNoteAnchor, processOle2, processParagraphes, processSymbol, setFontReplacer, setPicturesManager, tryDeadField
-
-
-
-
Constructor Detail
-
WordToHtmlConverter
public WordToHtmlConverter(Document document)
Creates new instance of WordToHtmlConverter. Can be used for output severalHWPFDocument
s into single HTML document.- Parameters:
document
- XML DOM Document used as HTML document
-
WordToHtmlConverter
public WordToHtmlConverter(HtmlDocumentFacade htmlDocumentFacade)
-
-
Method Detail
-
main
public static void main(String[] args) throws IOException, ParserConfigurationException, TransformerException
Java main() interface to interact with WordToHtmlConverterUsage: WordToHtmlConverter infile outfile
Where infile is an input .doc file ( Word 95-2007) which will be rendered as HTML into outfile
-
afterProcess
protected void afterProcess()
Description copied from class:AbstractWordConverter
Special actions that need to be called after processing complete, like updating stylesheets or building document notes list. Usually they are called once, but it's okay to call them several times.- Overrides:
afterProcess
in classAbstractWordConverter
-
getDocument
public Document getDocument()
- Specified by:
getDocument
in classAbstractWordConverter
-
outputCharacters
protected void outputCharacters(Element pElement, CharacterRun characterRun, String text)
- Specified by:
outputCharacters
in classAbstractWordConverter
-
processBookmarks
protected void processBookmarks(HWPFDocumentCore wordDocument, Element currentBlock, Range range, int currentTableLevel, List<Bookmark> rangeBookmarks)
Description copied from class:AbstractWordConverter
Wrap range into bookmark(s) and process it. All bookmarks have starts equal to range start and ends equal to range end. Usually it's only one bookmark.- Specified by:
processBookmarks
in classAbstractWordConverter
-
processDocumentInformation
protected void processDocumentInformation(SummaryInformation summaryInformation)
- Specified by:
processDocumentInformation
in classAbstractWordConverter
-
processDocumentPart
public void processDocumentPart(HWPFDocumentCore wordDocument, Range range)
- Overrides:
processDocumentPart
in classAbstractWordConverter
-
processDropDownList
protected void processDropDownList(Element block, CharacterRun characterRun, String[] values, int defaultIndex)
- Overrides:
processDropDownList
in classAbstractWordConverter
-
processDrawnObject
protected void processDrawnObject(HWPFDocument doc, CharacterRun characterRun, OfficeDrawing officeDrawing, String path, Element block)
- Specified by:
processDrawnObject
in classAbstractWordConverter
-
processEndnoteAutonumbered
protected void processEndnoteAutonumbered(HWPFDocument wordDocument, int noteIndex, Element block, Range endnoteTextRange)
- Specified by:
processEndnoteAutonumbered
in classAbstractWordConverter
-
processFootnoteAutonumbered
protected void processFootnoteAutonumbered(HWPFDocument wordDocument, int noteIndex, Element block, Range footnoteTextRange)
- Specified by:
processFootnoteAutonumbered
in classAbstractWordConverter
-
processHyperlink
protected void processHyperlink(HWPFDocumentCore wordDocument, Element currentBlock, Range textRange, int currentTableLevel, String hyperlink)
- Specified by:
processHyperlink
in classAbstractWordConverter
-
processImage
protected void processImage(Element currentBlock, boolean inlined, Picture picture, String imageSourcePath)
- Specified by:
processImage
in classAbstractWordConverter
-
processImageWithoutPicturesManager
protected void processImageWithoutPicturesManager(Element currentBlock, boolean inlined, Picture picture)
- Specified by:
processImageWithoutPicturesManager
in classAbstractWordConverter
-
processLineBreak
protected void processLineBreak(Element block, CharacterRun characterRun)
- Specified by:
processLineBreak
in classAbstractWordConverter
-
processNoteAutonumbered
protected void processNoteAutonumbered(HWPFDocument doc, String type, int noteIndex, Element block, Range noteTextRange)
-
processPageBreak
protected void processPageBreak(HWPFDocumentCore wordDocument, Element flow)
- Specified by:
processPageBreak
in classAbstractWordConverter
-
processPageref
protected void processPageref(HWPFDocumentCore hwpfDocument, Element currentBlock, Range textRange, int currentTableLevel, String pageref)
- Specified by:
processPageref
in classAbstractWordConverter
-
processParagraph
protected void processParagraph(HWPFDocumentCore hwpfDocument, Element parentElement, int currentTableLevel, Paragraph paragraph, String bulletText)
- Specified by:
processParagraph
in classAbstractWordConverter
-
processSection
protected void processSection(HWPFDocumentCore wordDocument, Section section, int sectionCounter)
- Specified by:
processSection
in classAbstractWordConverter
-
processSingleSection
protected void processSingleSection(HWPFDocumentCore wordDocument, Section section)
- Overrides:
processSingleSection
in classAbstractWordConverter
-
processTable
protected void processTable(HWPFDocumentCore hwpfDocument, Element flow, Table table)
- Specified by:
processTable
in classAbstractWordConverter
-
-