Uploaded image for project: 'Xerces-C++'
  1. Xerces-C++
  2. XERCESC-2065

Carriage return entities are not handled properly

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      Documents with CR entities don't seem to round trip properly in the parser if you parse them and then serialize them. It's possible the bug is in the serializer because signed documents don't end up with corrupt signatures, but that may be due to insufficient testing as of yet.

      A simple example:

      <?xml version="1.0" encoding="UTF-8"?>
      <foo>
         text&#13;more&lt;&amp;
      </foo>
      

      Running that through DOMPrint or SAX2Print:

      <foo>
      more&lt;&amp;
      </foo>
      

      Notice the CR entity is removed, but also all of the characters immediately in front of it.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            scantor Scott Cantor
            scantor Scott Cantor
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment