Uploaded image for project: 'Xerces2-J'
  1. Xerces2-J
  2. XERCESJ-1094

Xerces in infinite loop validating wrongly encoded XML 1.1 documents

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • 2.7.1
    • 2.8.0
    • None
    • None
    • Linux, Solaris, jdk 1.2.2/1.4/1.5

    Description

      When parsing a XML1.1 document from an InputSource, where the encoding is set to iso-8859-1, with an encoding set to UTF-8 in the XML declaration, and with a iso-8859-2 character in an attribute, then xerces enters an infinite loop.
      If the same character is not in the attribute, then Xerces reports an invalid XML character instead of blocking.
      If the encoding of the input source is not set to iso-8859-1, Xerces works fine also.

      Sample doc and modified DocumentScanner that demonstrate the issue at http://jigsaw.w3.org/Yves/xercesBug.zip
      Thanks,

      Attachments

        Activity

          People

            mrglavas@ca.ibm.com Michael Glavassevich
            yves Yves Lafon
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: