[FOP-1969] Support for unicode Surrogate pairs - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Resolution: Fixed
Affects Version/s: 2.5
Fix Version/s: 2.3
Component/s: unqualified
Labels:
None
Environment:
Operating System: All
Platform: All

External issue ID:
51843

Description

unicode codepoints outside of the BMP (base multilingual plane), i.e., whose scalar value is greater than 0xFFFF (65535), are coded as UTF-16 surrogate pairs in Java strings, which pair should be treated as a single codepoint for the purpose of mapping to a glyph in a font (that supports extra-BMP mappings);

at present, FOP does not correctly handle this case in simple (non complex script) rendering paths;

furthermore, though some support has been added to handle this in the complex script rendering path, it has not yet been tested, so is not necessarily working there either;

fop test.fo -c fop.xconf out.pdf

Glyphs should be rendered

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

testing.xsl
12/Jun/12 07:26
0.9 kB
ngkit
testing.pdf
12/Jun/12 07:29
5 kB
ngkit
testing.xml
12/Jun/12 07:31
0.0 kB
ngkit
testing.fo
12/Jun/12 08:30
0.8 kB
ngkit
testing.fo
12/Jun/12 08:32
0.6 kB
ngkit
testing.pdf
12/Jun/12 08:32
5 kB
ngkit
pcltest.zip
20/Sep/16 09:42
0.7 kB
Simon Steiner
Urdu.zip
20/Sep/16 09:47
2 kB
Simon Steiner
tiffttc.zip
20/Sep/16 10:06
0.7 kB
Simon Steiner
single-byte.zip
22/Sep/16 14:05
2 kB
Simon Steiner
fop.xconf
09/Mar/18 11:05
0.7 kB
Simon Steiner
test.fo
09/Mar/18 11:05
0.7 kB
Simon Steiner
AndroidEmoji.ttf
09/Mar/18 11:05
438 kB
Simon Steiner

Issue Links

relates to

FOP-2638 FOText.getScript() may prevent gsub/gpos application

Open

links to

GitHub Pull Request #3

Activity

People

Assignee:: Simon Steiner

Reporter:: Glenn Adams

Votes:: 6 Vote for this issue

Watchers:: 14 Start watching this issue

Dates

Created:: 19/Sep/11 01:35

Updated:: 16/May/18 09:33

Resolved:: 19/Mar/18 08:51