[PDFBOX-283] Character encoding/appearance issues when filling forms - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Critical
Resolution: Fixed
Affects Version/s: 2.0.0
Fix Version/s: 2.0.0
Component/s: AcroForm
Labels:
None

Description

[imported from SourceForge]
http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1735902
Originally submitted by scop on 2007-06-12 10:23.

When filling a text field with non-ASCII characters such as in my surname "SkyttÃ¤" and saving the document in a UTF-8 environment, something goes wrong with the appearance of the text.

The value itself seems to be stored correctly, but when opening the doc, the appearance of "Ã¤" is not that, but rather something which happens when UTF-8 is mistakenly treated as ISO-8859-1 (two garbage characters).

PDAppearance uses the platform default encoding in quite a few places which apparently has potential to mess things up. In particular, insertGeneratedAppearance() generates a PrintWriter from an OutputStream without specifying the encoding. In fact, if I hack that to use ISO-8859-1, the appearance of my "Ã¤" case is correct, but that won't obviously work with anything else than chars that are valid ISO-8859-1.

In which char encoding should the value be written to the appearance stream (at end of insertGeneratedAppearance())?

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

acroform.pdf
09/Jul/14 10:00
464 kB
Marco Primiceri
PDAppearance_bis.diff
08/Jul/14 19:32
2 kB
Marco Primiceri
PDAppearance.diff
08/Jul/14 19:32
2 kB
Marco Primiceri
PDAppearance.patch
13/Sep/13 13:07
0.8 kB
Maruan Sahyoun

Issue Links

depends upon

PDFBOX-922 True type PDFont subclass only supports WinAnsiEncoding (hardcoded!)

Closed

PDFBOX-2333 Overhaul the appearance generation for PDF forms

Closed

Activity

People

Assignee:: Unassigned

Reporter:: Anonymous

Votes:: 3 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 12/Jun/07 17:23

Updated:: 17/Mar/16 19:08

Resolved:: 12/Dec/14 22:10