Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-1603

Regression in PDDocument.loadNonSeq ?

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.8.1
    • 1.8.2
    • None
    • None
    • jruby 1.7.3 (1.9.3p385) 2013-02-21 dac429b on Java HotSpot(TM) 64-Bit Server VM 1.6.0_45-b06-451-11M4406 [darwin-x86_64]

    Description

      Sometime ago I reported PDFBOX-1483, when I came across a PDF (attached to that issue) that couldn't be loaded with PDDocument.load but worked fine with PDDocument.loadNonSeq. The latter method worked with all the PDFs I tested.

      Now (PDFBox-2.0.0-SNAPSHOT, just built from source) PDDocument.loadNonSeq is failing for all the PDFs that were previously working.

      Sample traceback:

      Java::JavaIo::IOException: Object must be defined and must not be compressed object: 13:0
      org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parseObjectDynamically(NonSequentialPDFParser.java:1115)
      org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parseObjectDynamically(NonSequentialPDFParser.java:1078)
      org.apache.pdfbox.pdfparser.NonSequentialPDFParser.initialParse(NonSequentialPDFParser.java:343)
      org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parse(NonSequentialPDFParser.java:657)
      org.apache.pdfbox.pdmodel.PDDocument.loadNonSeq(PDDocument.java:1245)
      org.apache.pdfbox.pdmodel.PDDocument.loadNonSeq(PDDocument.java:1228)

      Attachments

        1. gre.pdf
          51 kB
          Manuel Aristaran

        Issue Links

          Activity

            People

              lehmi Andreas Lehmkühler
              maristaran Manuel Aristaran
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: