Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Peter Wyatt recently published an article on UTF-8 strings in PDF 2.0: https://www.pdfa.org/understanding-utf-8-in-pdf-2-0/
The article includes a link to a test file he created: https://github.com/pdf-association/pdf20examples/blob/master/pdf20-utf8-test.pdf
Our debugger shows that we may need to add support for this (see attached). This was with PDFBox 2.0.25. I didn't have a chance to test with 3.x or the 2.x snapshot.
I don't think we're necessarily covering all the changes yet in PDF 2.0, but I thought I'd open this issue for at least discussion.