Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-276

IOException on parsing a PDF file

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 1.2.0
    • Parsing
    • None

    Description

      [imported from SourceForge]
      http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1722594
      Originally submitted by doublep-enw on 2007-05-21 05:10.

      When parsing the attached file, PDFBox throws the following exception:

      java.io.IOException: expected='/' actual='?'--1 org.pdfbox.io.PushBackInputStream@159f498
      at org.pdfbox.pdfparser.BaseParser.parseCOSName(BaseParser.java:774)
      at org.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:217)
      at org.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:910)
      at org.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:432)
      at org.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:176)

      The file does look strange inside, but PDF viewers don't seem to care.

      [attachment on SourceForge]
      http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1722594&file_id=229983
      NotIndexedDocument.pdf (application/pdf), 8728 bytes
      unparseable file

      Attachments

        1. BaseParser.java
          45 kB
          Peter_Lenahan@ibi.com
        2. pdfbox-276-baseparser-patch-938120.txt
          4 kB
          Peter_Lenahan@ibi.com
        3. PDFBOX276-NotIndexedDocument.pdf
          9 kB
          Andreas Lehmkühler

        Issue Links

          Activity

            People

              Unassigned Unassigned
              Anonymous Anonymous
              Votes:
              2 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: