Class PdfContentReaderTool


  • public class PdfContentReaderTool
    extends Object
    Tool that parses the content of a PDF document.
    Since:
    2.1.4
    • Constructor Detail

      • PdfContentReaderTool

        public PdfContentReaderTool()
    • Method Detail

      • getDictionaryDetail

        public static String getDictionaryDetail​(PdfDictionary dic)
        Shows the detail of a dictionary. This is similar to the PdfLister functionality.
        Parameters:
        dic - the dictionary of which you want the detail
        Returns:
        a String representation of the dictionary
      • getDictionaryDetail

        public static String getDictionaryDetail​(PdfDictionary dic,
                                                 int depth)
        Shows the detail of a dictionary.
        Parameters:
        dic - the dictionary of which you want the detail
        depth - the depth of the current dictionary (for nested dictionaries)
        Returns:
        a String representation of the dictionary
      • getXObjectDetail

        public static String getXObjectDetail​(PdfDictionary resourceDic)
                                       throws IOException
        Displays a summary of the entries in the XObject dictionary for the stream
        Parameters:
        resourceDic - the resource dictionary for the stream
        Returns:
        a string with the summary of the entries
        Throws:
        IOException
        Since:
        5.0.2
      • listContentStreamForPage

        public static void listContentStreamForPage​(PdfReader reader,
                                                    int pageNum,
                                                    PrintWriter out)
                                             throws IOException
        Writes information about a specific page from PdfReader to the specified output stream.
        Parameters:
        reader - the PdfReader to read the page content from
        pageNum - the page number to read
        out - the output stream to send the content to
        Throws:
        IOException
        Since:
        2.1.5
      • listContentStream

        public static void listContentStream​(File pdfFile,
                                             PrintWriter out)
                                      throws IOException
        Writes information about each page in a PDF file to the specified output stream.
        Parameters:
        pdfFile - a File instance referring to a PDF file
        out - the output stream to send the content to
        Throws:
        IOException
        Since:
        2.1.5
      • listContentStream

        public static void listContentStream​(File pdfFile,
                                             int pageNum,
                                             PrintWriter out)
                                      throws IOException
        Writes information about the specified page in a PDF file to the specified output stream.
        Parameters:
        pdfFile - a File instance referring to a PDF file
        pageNum - the page number to read
        out - the output stream to send the content to
        Throws:
        IOException
        Since:
        2.1.5
      • main

        public static void main​(String[] args)
        Writes information about each page in a PDF file to the specified file, or System.out.
        Parameters:
        args -