Uploaded image for project: 'Commons Imaging'
  1. Commons Imaging
  2. IMAGING-262

TIFF Parser ignores XMP Metadata if TagType is UNDEFINED

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.0-alpha1
    • None
    • Format: TIFF
    • Patch

    Description

      commons-imaging only extracts XMP metadata from TIFFs when the IFD's type field is BYTE. According to the Adobe spec (https://www.adobe.com/devnet/xmp.html, part 3 "Storage in Files" section 1.1.6 "TIFF", table 13) it can be either UNDEFINED or BYTE, and some of the files we are processing have it set to UNDEFINED lately. I cannot share the offending files unfortunately because they're somewhat confidential.

      The attached patch changes commons-imaging to write BYTES, but accept either BYTES or UNDEFINED when reading. It passes all existing unit tests and we've had it in production for a while without problems. The diff is against 1.0-alpha1, but applies to master with line offsets.

      Please let me know if you need further information.

      Attachments

        Activity

          People

            Unassigned Unassigned
            matthias.fratz Matthias Fratz
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: