Class PdfContentReaderTool


  • public class PdfContentReaderTool
    extends Object
    Tool that parses the content of a PDF document.
    Since:
    2.1.4
    • Constructor Detail

      • PdfContentReaderTool

        public PdfContentReaderTool()
    • Method Detail

      • getDictionaryDetail

        public static String getDictionaryDetail​(PdfDictionary dic)
        Shows the detail of a dictionary. This is similar to the PdfLister functionality.
        Parameters:
        dic - the dictionary of which you want the detail
        Returns:
        a String representation of the dictionary
      • getDictionaryDetail

        public static String getDictionaryDetail​(PdfDictionary dic,
                                                 int depth)
        Shows the detail of a dictionary.
        Parameters:
        dic - the dictionary of which you want the detail
        depth - the depth of the current dictionary (for nested dictionaries)
        Returns:
        a String representation of the dictionary
      • listContentStreamForPage

        public static void listContentStreamForPage​(PdfReader reader,
                                                    int pageNum,
                                                    PrintWriter out)
                                             throws IOException
        Writes information about a specific page from PdfReader to the specified output stream.
        Parameters:
        reader - the PdfReader to read the page content from
        pageNum - the page number to read
        out - the output stream to send the content to
        Throws:
        IOException - thrown when an I/O operation goes wrong
        Since:
        2.1.5
      • listContentStream

        public static void listContentStream​(File pdfFile,
                                             PrintWriter out)
                                      throws IOException
        Writes information about each page in a PDF file to the specified output stream.
        Parameters:
        pdfFile - a File instance referring to a PDF file
        out - the output stream to send the content to
        Throws:
        IOException - thrown when an I/O operation goes wrong
        Since:
        2.1.5
      • listContentStream

        public static void listContentStream​(File pdfFile,
                                             int pageNum,
                                             PrintWriter out)
                                      throws IOException
        Writes information about the specified page in a PDF file to the specified output stream.
        Parameters:
        pdfFile - a File instance referring to a PDF file
        pageNum - the page number to read
        out - the output stream to send the content to
        Throws:
        IOException - thrown when an I/O operation goes wrong
        Since:
        2.1.5
      • main

        public static void main​(String[] args)
        Writes information about each page in a PDF file to the specified file, or System.out.
        Parameters:
        args - the arguments passed to the command line