Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-5161

Content stream parse error that doesn't happen when content stream is parsed alone

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 3.0.0 PDFBox
    • 3.0.0 PDFBox
    • Parsing

    Description

      java.io.IOException: Unknown dir object c=')' cInt=41 peek=')' peekInt=41 at offset 12287
          org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:865)
          org.apache.pdfbox.pdfparser.BaseParser.parseCOSArray(BaseParser.java:634)
          org.apache.pdfbox.pdfparser.PDFStreamParser.parseNextToken(PDFStreamParser.java:130)
      

      This code doesn't reproduce the problem:

              byte[] bytes = Files.readAllBytes(Paths.get("cs.txt"));
              PDFStreamParser parser = new PDFStreamParser(bytes);
              parser.parse();
      

      Attachments

        1. 179212.pdf
          207 kB
          Tilman Hausherr
        2. cs.txt
          23 kB
          Tilman Hausherr

        Activity

          People

            tilman Tilman Hausherr
            tilman Tilman Hausherr
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: