Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-159

Field renaming character set problem

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 0.7.0
    • 1.8.9, 2.0.0
    • AcroForm
    • None

    Description

      [imported from SourceForge]
      http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1482334
      Originally submitted by mmajis on 2006-05-05 02:30.

      If I take an XFA PDF, remove the XFA form, rename one
      of the AcroForm fields and open the resulting PDF in
      Acrobat, the field name is garbled.

      To begin with, it seems the field name is UTF-16BE
      encoded. PDFBox writes the new /T entry in ISO-8859-1
      since it finds no character values larger than 255. if
      I change the source code to write UTF-16BE with the two
      prefix bytes 0xFF and 0xFE (not sure of the byte order
      here), the field name shows correctly in Acrobat.

      Attachments

        1. fieldNamesBefore.png
          6 kB
          Maruan Sahyoun
        2. fieldNamesAfter.png
          6 kB
          Maruan Sahyoun

        Activity

          People

            Unassigned Unassigned
            Anonymous Anonymous
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: