Uploaded image for project: 'Axis'
  1. Axis
  2. AXIS-2342

Reopen issue: Character entities are escaped too aggressively

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Resolution: Unresolved
    • 1.0
    • None
    • None
    • Operating System: All
      Platform: All
    • 19327

    Description

      We are using SOAP to send XML documents from client to server and back. The
      documents contain a lot of non-ASCII data. This is encoded as UTF-8 by us.
      However, when retrieved from an Axis server, Axis will escape almost all of our
      characters into character entities (so &#... This means messages become about
      three times as big as they have to for 'international' documents, which for us
      is a large performance problem. I narrowed down the problem to
      XMLUtils::xmlEncodeString
      that has the code:
      if (((int)chars[i]) > 127) {
      strBuf.append("&#");
      strBuf.append((int)chars[i]);
      strBuf.append(";");
      This seems unnecessary to me, as Axis will send all messages in UTF-8 anyway,
      for which no encoding is necessary (and should encoding be configurable, I feel
      this should be escaped elsewhere).

      Is there any reason for this code, I commented it out and it seemed to have no
      adverse effect on our application (apart from reduced network traffic)?

      Tested with 1.0, also looked up in the sources of 1.1-rc2.

      Attachments

        1. AXIS_2342.diff
          3 kB
          Rodrigo Ruiz
        2. PATCH_2342.txt
          2 kB
          Christian Müller
        3. TEST_2342.diff
          5 kB
          Rodrigo Ruiz
        4. TESTCASE_2342.txt
          3 kB
          Christian Müller

        Issue Links

          Activity

            People

              Unassigned Unassigned
              bauermann Thiago Jung Bauermann
              Votes:
              4 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated: