Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-2715

Pages in a PDF being dropped with just an error-log message

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.8.8
    • Fix Version/s: 1.8.8, 2.0.0
    • Component/s: PDModel
    • Labels:
      None
    • Environment:
      Linux, Java 7.

      Description

      Trying to excatly pages from PDF documents like this

      PDDocument doc = PDDocument.load(new ByteArrayInputStream(pdf));
      List allPages = doc.getDocumentCatalog().getAllPages();
      

      But not all pages get read, and the only indication something is wrong is this error-logging:

      ERROR org.apache.pdfbox.pdmodel.PDPageNode.getAllKids()#202: No Kids found in getAllKids(). Probably a malformed pdf.
      

      I'm getting one of these error-lines for each page that isn't read. I'm attaching two different files with this problem. One gives me 4 out of 6 pages, and the other gives me none of the 4 pages. Both documents read fine in Acrobat Reader and in Okular where all the pages get shown.

        Attachments

        1. IT-11557_pdf_broken_pages_F150317DYCELZZ.pdf
          440 kB
          Cecilie Fritzvold
        2. IT-11557_pdf_broken_pages.pdf
          803 kB
          Cecilie Fritzvold

          Activity

            People

            • Assignee:
              lehmi Andreas Lehmkühler
              Reporter:
              Cecilie Cecilie Fritzvold
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: