Class ApachePdfBoxDocumentParser

java.lang.Object
dev.langchain4j.data.document.parser.apache.pdfbox.ApachePdfBoxDocumentParser
All Implemented Interfaces:
dev.langchain4j.data.document.DocumentParser

public class ApachePdfBoxDocumentParser extends Object implements dev.langchain4j.data.document.DocumentParser
Parses PDF file into a Document using Apache PDFBox library
  • Constructor Details

    • ApachePdfBoxDocumentParser

      public ApachePdfBoxDocumentParser()
    • ApachePdfBoxDocumentParser

      public ApachePdfBoxDocumentParser(boolean includeMetadata)
  • Method Details

    • parse

      public dev.langchain4j.data.document.Document parse(InputStream inputStream)
      Specified by:
      parse in interface dev.langchain4j.data.document.DocumentParser