Uploaded image for project: 'Xerces2-J'
  1. Xerces2-J
  2. XERCESJ-1383

Adding Unicode Normalization support to Xerces2-J

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: In Progress
    • Major
    • Resolution: Unresolved
    • 2.9.1
    • None
    • DOM (Level 3 Core), SAX
    • None
    • All

    Description

      This feature will add support for Unicode character normalization and normalization checking to Xerces. Applications that use Xerces will be able to produce fully normalized XML documents and verify that any XML documents they process are fully normalised.

      Adding this functionality will allow Xerces to meet the XML 1.1 W3C Recommendation regarding character normalization and allow it to implement the optional character normalization and normalization checking features specified in the DOM Level 3 Core and SAX2.

      More specifically, the features to be implemented are:
      DOM Level 3 Core: "normalize-characters" [1]
      DOM Level 3 Core: "check-character-normalization" [2]
      SAX2: "unicode-normalization-checking" [3]

      [1] http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-normalize-characters
      [2] http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-check-character-normalization
      [3] http://www.saxproject.org/apidoc/org/xml/sax/package-summary.html

      Attachments

        1. CharacterNormalizer.zip
          501 kB
          Richard Kelly
        2. test.xml
          1 kB
          Richard Kelly

        Activity

          People

            mrglavas@ca.ibm.com Michael Glavassevich
            rakkie Richard Kelly
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated: