Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-4424

IOException when merging PDF documents containing URLs with unmatched brakets.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Not A Bug
    • 2.0.9, 2.0.13
    • None
    • Parsing
    • None

    Description

      Steps to reproduce:

      1. Download the attached files to a directory.

      2. Refactor the constants in PdfBoxDefect.java to reference the downloaded files.

      3. Run PdfBoxDefect.java.

      4. Note, the error merging error:

      Merging /Users/paulslauenwhite/Downloads/Test_PDF_Working.pdf to /Users/paulslauenwhite/Downloads/Test_PDF_Working_MERGED.pdf.

      Merged /Users/paulslauenwhite/Downloads/Test_PDF_Working.pdf to /Users/paulslauenwhite/Downloads/Test_PDF_Working_MERGED.pdf.

      Merging /Users/paulslauenwhite/Downloads/Test_PDF_Not_Working.pdf to /Users/paulslauenwhite/Downloads/Test_PDF_Not_Working_MERGED.pdf.

      15:59:32,120 [main]  WARN org.apache.pdfbox.pdfparser.BaseParser              - Invalid dictionary, found: 'Æ' but expected: '/' at offset 13199

      15:59:32,120 [main]  WARN org.apache.pdfbox.pdfparser.BaseParser              - Bad dictionary declaration at offset 13224

      15:59:32,120 [main]  WARN org.apache.pdfbox.pdfparser.BaseParser              - Invalid dictionary, found: 'R' but expected: '/' at offset 13224

      15:59:32,120 [main]  WARN org.apache.pdfbox.pdfparser.BaseParser              - Bad dictionary declaration at offset 13314

      15:59:32,120 [main]  WARN org.apache.pdfbox.pdfparser.BaseParser              - Invalid dictionary, found: '_' but expected: '/' at offset 13314

      15:59:32,120 [main]  WARN org.apache.pdfbox.pdfparser.BaseParser              - Corrupt object reference at offset 13574

      15:59:32,120 [main]  WARN org.apache.pdfbox.pdfparser.BaseParser              - Corrupt object reference at offset 13588

      java.io.IOException: Unknown dir object c=')' cInt=41 peek=')' peekInt=41 at offset 13588

      at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:961)

      at org.apache.pdfbox.pdfparser.BaseParser.parseCOSArray(BaseParser.java:631)

      at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:874)

      at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:152)

      at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:279)

      at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:212)

      at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:862)

      at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:852)

      at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:821)

      at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741)

      at org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:701)

      at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:205)

      at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:240)

      at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1144)

      at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1060)

      at org.apache.pdfbox.multipdf.PDFMergerUtility.mergeDocuments(PDFMergerUtility.java:261)

      at org.apache.pdfbox.multipdf.PDFMergerUtility.mergeDocuments(PDFMergerUtility.java:231)

      at PdfBoxDefect.merge(PdfBoxDefect.java:73)

      at PdfBoxDefect.main(PdfBoxDefect.java:48)

       

      Attachments

        1. PdfBoxDefect.java
          3 kB
          Paul Slauenwhite
        2. Test_PDF_Not_Working.pdf
          19 kB
          Paul Slauenwhite
        3. Test_PDF_Working.pdf
          19 kB
          Paul Slauenwhite

        Activity

          People

            Unassigned Unassigned
            paulslauenwhite Paul Slauenwhite
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: