Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-159

Field renaming character set problem

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 0.7.0
    • Fix Version/s: 1.8.9, 2.0.0
    • Component/s: AcroForm
    • Labels:
      None

      Description

      [imported from SourceForge]
      http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1482334
      Originally submitted by mmajis on 2006-05-05 02:30.

      If I take an XFA PDF, remove the XFA form, rename one
      of the AcroForm fields and open the resulting PDF in
      Acrobat, the field name is garbled.

      To begin with, it seems the field name is UTF-16BE
      encoded. PDFBox writes the new /T entry in ISO-8859-1
      since it finds no character values larger than 255. if
      I change the source code to write UTF-16BE with the two
      prefix bytes 0xFF and 0xFE (not sure of the byte order
      here), the field name shows correctly in Acrobat.

        Attachments

        1. fieldNamesAfter.png
          6 kB
          Maruan Sahyoun
        2. fieldNamesBefore.png
          6 kB
          Maruan Sahyoun

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              Anonymous
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: