Fop
  1. Fop
  2. FOP-1969

Surrogate pairs not treated as single unicode codepoint for display purposes

    Details

    • Type: Improvement Improvement
    • Status: Open
    • Resolution: Unresolved
    • Affects Version/s: trunk
    • Fix Version/s: None
    • Component/s: unqualified
    • Labels:
      None
    • Environment:
      Operating System: All
      Platform: All
    • External issue ID:
      51843

      Description

      unicode codepoints outside of the BMP (base multilingual plane), i.e., whose scalar value is greater than 0xFFFF (65535), are coded as UTF-16 surrogate pairs in Java strings, which pair should be treated as a single codepoint for the purpose of mapping to a glyph in a font (that supports extra-BMP mappings);

      at present, FOP does not correctly handle this case in simple (non complex script) rendering paths;

      furthermore, though some support has been added to handle this in the complex script rendering path, it has not yet been tested, so is not necessarily working there either;

      1. testing.xsl
        0.9 kB
        ngkit
      2. testing.pdf
        5 kB
        ngkit
      3. testing.xml
        0.0 kB
        ngkit
      4. testing.fo
        0.8 kB
        ngkit
      5. testing.fo
        0.6 kB
        ngkit
      6. testing.pdf
        5 kB
        ngkit

        Activity

        No work has yet been logged on this issue.

          People

          • Assignee:
            Glenn Adams
            Reporter:
            Glenn Adams
          • Votes:
            2 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

            • Created:
              Updated:

              Development