Uploaded image for project: 'Xerces-C++'
  1. Xerces-C++
  2. XERCESC-2065

Carriage return entities are not handled properly

    XMLWordPrintableJSON

Details

    Description

      Documents with CR entities don't seem to round trip properly in the parser if you parse them and then serialize them. It's possible the bug is in the serializer because signed documents don't end up with corrupt signatures, but that may be due to insufficient testing as of yet.

      A simple example:

      <?xml version="1.0" encoding="UTF-8"?>
      <foo>
         text&#13;more&lt;&amp;
      </foo>
      

      Running that through DOMPrint or SAX2Print:

      <foo>
      more&lt;&amp;
      </foo>
      

      Notice the CR entity is removed, but also all of the characters immediately in front of it.

      Attachments

        1. xercesc-2065.patch
          0.9 kB
          Scott Cantor

        Activity

          People

            scantor Scott Cantor
            scantor Scott Cantor
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: