Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-473

attempt to push back when content read

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.8.0-incubator
    • 0.8.0-incubator
    • None
    • None

    Description

      I got this exception parsing a pdf doc,

      Caused by: org.apache.pdfbox.exceptions.WrappedIOException
      at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:228)
      ... 2 more
      Caused by: java.lang.ArrayIndexOutOfBoundsException
      at java.io.PushbackInputStream.unread(PushbackInputStream.java:218)
      at org.apache.pdfbox.io.PushBackInputStream.unread(PushBackInputStream.java:123)
      at org.apache.pdfbox.pdfparser.BaseParser.parseCOSString(BaseParser.java:493)
      at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:843)
      at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:485)
      at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:169)
      ... 3 more

      This change to BaseParser fixes it,

      Index: src/main/java/org/apache/pdfbox/pdfparser/BaseParser.java
      ===================================================================
      — src/main/java/org/apache/pdfbox/pdfparser/BaseParser.java
      +++ src/main/java/org/apache/pdfbox/pdfparser/BaseParser.java
      @@ -492,7 +492,10 @@
      braces = 0;
      }
      }

      • pdfSource.unread( nextThreeBytes, 0, amountRead );
        + if(amountRead > 0)
        + { + pdfSource.unread( nextThreeBytes, 0, amountRead ); + }

        if( braces != 0 )
        {
        retval.append( ch );

      Attachments

        Activity

          People

            Unassigned Unassigned
            sgbridges Sean Bridges
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: