Class WordMLParser

java.lang.Object
org.apache.tika.parser.microsoft.xml.AbstractXML2003Parser
org.apache.tika.parser.microsoft.xml.WordMLParser
All Implemented Interfaces:
Serializable, org.apache.tika.parser.Parser

public class WordMLParser extends AbstractXML2003Parser
Parses wordml 2003 format word files. These are single xml files that predate ooxml.

See https://en.wikipedia.org/wiki/Microsoft_Office_XML_formats

See Also:
  • Constructor Details

    • WordMLParser

      public WordMLParser()
  • Method Details