Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-806

Failure to extract dc:description when the value is the node text

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.2.1
    • 1.4.0
    • JempBox
    • None

    Description

      I can not get the dc:description from the following xmpmeta:
      <x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0">
      <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
      <rdf:Description rdf:about=""
      xmlns:exif="http://ns.adobe.com/exif/1.0/">
      <exif:UserComment>exif</exif:UserComment>
      </rdf:Description>
      <rdf:Description rdf:about=""
      xmlns:dc="http://purl.org/dc/elements/1.1/">
      <dc:description>from XnViewMP 0.26</dc:description>
      </rdf:Description>
      </rdf:RDF>
      </x:xmpmeta>

      But it works from this:
      <x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0-Exiv2">
      <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
      <rdf:Description rdf:about=""
      xmlns:dc="http://purl.org/dc/elements/1.1/">
      <dc:description>
      <rdf:Alt>
      <rdf:li xml:lang="x-default">dc:description</rdf:li>
      </rdf:Alt>
      </dc:description>
      </rdf:Description>
      </rdf:RDF>
      </x:xmpmeta>

      Both of them are valid according to http://www.w3.org/RDF/Validator/. My interpretation of http://dublincore.org/documents/dcmes-xml/ is that the first format should be ok for Dublin Core too.

      See attached unit test.

      Attachments

        1. JempboxTest.java
          2 kB
          Staffan Olsson
        2. PDFBOX-806.patch
          5 kB
          Staffan Olsson

        Activity

          People

            adamnichols Adam Nichols
            solsson Staffan Olsson
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: