Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-3646

Annotations parsed from XFDF containing ampersand characters are not properly imported

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 2.0.3, 2.0.4, 2.0.5, 2.0.6
    • Fix Version/s: None
    • Component/s: AcroForm, PDModel
    • Labels:
    • Environment:
      java 1.8.0_112

      Description

      Annotations containing "&" in their text are displayed incorrectly when parsed unmodified from XFDF (the ampersands are encoded as "&" there) and added to a PDF document.
      This occurs for both "text comment" and "text box" type annotations.
      However, if the XFDF is modified by replacing "&" with "&" prior to parsing, the imported annotations are then displayed correctly.

      The attached code produces two pdf files. One is the PDF with the unmodified XFDF imported, two the PDF with the modifed XFDF.

      A XFDF containing both a text box and text comment annotation is embedded in the source and attached as a separated file.

      Update 23.03.2017 : This problem persists in 2.0.5 and we noticed the same corruption of merged annotations occur, if the annotation text contains a "<" (encoded as "lt" entity)

        Attachments

        1. MergeTest.java
          4 kB
          Kai Keggenhoff
        2. sample.xfdf
          2 kB
          Kai Keggenhoff
        3. output2.pdf
          3 kB
          Kai Keggenhoff
        4. output1.pdf
          3 kB
          Kai Keggenhoff

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              k.keggenhoff Kai Keggenhoff
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: