Uploaded image for project: 'Xerces2-J'
  1. Xerces2-J
  2. XERCESJ-1156

MalformedURLException occurs when SYSTEMID value is korean character relative url

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Incomplete
    • 2.6.2, 2.8.0
    • None
    • None
    • None
    • Windows XP (korean edition), JDK 1.4, JDK 5.0, xerces 2.6.0, xerces 2.8.2

    Description

      If a xml file has a DOCTYPE and the SYSTEMID of DOCTYPE has a korean character relative url,
      MalformedException occurs during parsing.
      I guess that the same problem will occur if japanese or chinese character is used in SYSTEMID.

      This does not occur in JDK 1.4.2 with its internal crimson parser.
      I found that Xerces 2.6.2, Xerces 2.8.0 and JDK 1.5 have this problem.

      This is related to XERCESJ-391 which seems to be fixed over Xerces 2.6.2 in my test.

      The test case xml/dtd and patch for Xerces 2.6.2 will be uploaded.
      The patch is simple. The SYSTEMID value which has non-us-ascii character needs to be escaped.
      The escaping logic is from the XMLEntityManager#getUserDir().

      I think that xerces 2.8.0 can be patched in the same way.

      Attachments

        1. XMLEntityManager.patch.2
          2 kB
          Jongjin Choi
        2. XMLEntityManager.patch
          2 kB
          Jongjin Choi
        3. XERCESJ-1156.jar
          1 kB
          Jongjin Choi

        Activity

          People

            Unassigned Unassigned
            jongjinchoi Jongjin Choi
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: