Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-474

invalid xref entry causes parser to fail

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.8.0-incubator
    • 0.8.0-incubator
    • None
    • None

    Description

      I have a pdf doc with an invalid xref of the form

      xref

      55 117

      0000000016 00000 n

      0000048365 00000 n

      0000072064 00000 00 n

      0000110322 00000 n

      The problematic line is,

      0000072064 00000 00 n

      Which fails to parse with an exception of,

      Caused by: java.io.IOException: Corrupt XRefTable Entry - ObjID:79
      at org.apache.pdfbox.pdfparser.PDFParser.parseXrefTable(PDFParser.java:645)
      at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:414)
      at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:169)

      The pdf is invalid, but it would be nice to parse it.

      Attachments

        1. patch
          0.9 kB
          Sean Bridges

        Activity

          People

            Unassigned Unassigned
            sgbridges Sean Bridges
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: