Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-4132

Unknown dir object c=')' cInt=41 peek=')' peekInt=41 at offset 2701

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.0.8
    • 2.0.28, 3.0.0 PDFBox
    • Parsing
    • None

    Description

      The attached document gives an "IOException: Unknown dir object..." when parsing it, stack trace:

      java.io.IOException: Unknown dir object c=')' cInt=41 peek=')' peekInt=41 at offset 2701
          at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:959)
          at org.apache.pdfbox.pdfparser.BaseParser.parseCOSArray(BaseParser.java:631)
          at org.apache.pdfbox.pdfparser.PDFStreamParser.parseNextToken(PDFStreamParser.java:174)
          at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:502)
          at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
          at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
          at org.apache.pdfbox.text.LegacyPDFStreamEngine.processPage(LegacyPDFStreamEngine.java:139)
          at org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:391)
          at org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:319)
          at org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:266)
          at org.apache.pdfbox.text.PDFTextStripper.getText(PDFTextStripper.java:227)
      

       

      Attachments

        1. buggy.pdf
          88 kB
          Martin Deutsch

        Activity

          People

            lehmi Andreas Lehmkühler
            sempfer Martin Deutsch
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: