Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-4860

Preflight doesn't catch repetition of elements in XMP

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.0.19
    • Fix Version/s: 2.0.21, 3.0.0 PDFBox
    • Component/s: Preflight, XmpBox
    • Labels:
      None

      Description

      We don't catch this:

      https://www.pdf-online.com/osa/validate.aspx

      Validating file "Testdok-modified-003.pdf" for conformance level pdfa-1a
      Multiple occurrences of property 'pdf:Producer'.
      Multiple occurrences of property 'xmp:CreateDate'.
      Multiple occurrences of property 'xmp:CreatorTool'.
      The value of the key ToUnicode is invalid but must be compliant to the ToUnicode CMap format.
      The document does not conform to the requested standard.
      The document contains fonts without appropriate character to unicode mapping information (ToUnicode maps).
      The document's meta data is either missing or inconsistent or corrupt.
      The document does not conform to the PDF/A-1a standard.
      Done.

      I'm only focusing on the repetition in this issue.

       

      <?xpacket begin="" id="W5M0MpCehiHzreSzNTczkc9d"?><x:xmpmeta xmlns:x="adobe:ns:meta/">
        <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
          <rdf:Description xmlns:pdfaid="http://www.aiim.org/pdfa/ns/id/" rdf:about="">
            <pdfaid:part>1</pdfaid:part>
            <pdfaid:conformance>A</pdfaid:conformance>
          </rdf:Description>
          <rdf:Description xmlns:pdf="http://ns.adobe.com/pdf/1.3/" rdf:about="">
            <pdf:Producer>LibreOffice 5.4</pdf:Producer>
          </rdf:Description>
          <rdf:Description xmlns:xmp="http://ns.adobe.com/xap/1.0/" rdf:about="">
            <xmp:CreatorTool>Writer</xmp:CreatorTool>
            <xmp:CreateDate>2020-05-31T20:54:08+02:00</xmp:CreateDate>
          </rdf:Description>
          <rdf:Description xmlns:pdf="http://ns.adobe.com/pdf/1.3/" rdf:about="">
            <pdf:Keywords/>
            <pdf:Producer>LibreOffice 5.4</pdf:Producer>
          </rdf:Description>
          <rdf:Description xmlns:xmp="http://ns.adobe.com/xap/1.0/" rdf:about="">
            <xmp:ModifyDate>2020-06-03T08:38:15+02:00</xmp:ModifyDate>
            <xmp:CreateDate>2020-05-31T20:54:08+02:00</xmp:CreateDate>
            <xmp:CreatorTool>Writer</xmp:CreatorTool>
            <xmp:MetadataDate>2020-06-03T08:38:15+02:00</xmp:MetadataDate>
          </rdf:Description>
          <rdf:Description xmlns:dc="http://purl.org/dc/elements/1.1/" rdf:about="">
            <dc:title>
              <rdf:Alt>
                <rdf:li xml:lang="x-default">Default Title</rdf:li>
              </rdf:Alt>
            </dc:title>
            <dc:creator>
              <rdf:Seq>
                <rdf:li>PDFBox</rdf:li>
              </rdf:Seq>
            </dc:creator>
            <dc:description>
              <rdf:Alt>
                <rdf:li xml:lang="x-default">Default Subject</rdf:li>
              </rdf:Alt>
            </dc:description>
          </rdf:Description>
        </rdf:RDF>
      </x:xmpmeta><?xpacket end="w"?>
       

        Attachments

        1. Testdok-modified-003.pdf
          21 kB
          Tilman Hausherr

          Activity

            People

            • Assignee:
              tilman Tilman Hausherr
              Reporter:
              tilman Tilman Hausherr
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: