Uploaded image for project: 'Commons Imaging'
  1. Commons Imaging
  2. IMAGING-174

Support non-8BIM signatures in Photoshop segments

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Format: JPEG
    • None

    Description

      The code in IptcParser.parseAllBlocks(...) requires that all blocks have an "8BIM" signature. However, we're frequently finding jpeg files that have "PHUT" signatures mixed in. Some sites also report "AgHg" and "DCSR" signatures, for example: http://dev.exiv2.org/issues/800. Although the signature is not what the code expects, the block's data layout is still the same as for 8BIM. Please consider either parsing such blocks, or at least skip them with a warning. Currently, the code throws an exception which prevents us from extracting any of the other metadata. I'm attaching a sample image from the Enron Corpus that has two of these PHUT resource blocks.

      Attachments

        1. friends.jpg
          73 kB
          Arjohn Kampman

        Activity

          People

            Unassigned Unassigned
            arjohn Arjohn Kampman
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: