Class ApachePoiDocumentParser

java.lang.Object
dev.langchain4j.data.document.parser.apache.poi.ApachePoiDocumentParser
All Implemented Interfaces:
dev.langchain4j.data.document.DocumentParser

public class ApachePoiDocumentParser extends Object implements dev.langchain4j.data.document.DocumentParser
Parses Microsoft Office file into a Document using Apache POI library. This parser supports various file formats, including doc, docx, ppt, pptx, xls, and xlsx. For detailed information on supported formats, please refer to the official Apache POI website.
  • Constructor Details

    • ApachePoiDocumentParser

      public ApachePoiDocumentParser()
  • Method Details

    • parse

      public dev.langchain4j.data.document.Document parse(InputStream inputStream)
      Specified by:
      parse in interface dev.langchain4j.data.document.DocumentParser