Axis
  1. Axis
  2. AXIS-2342

Reopen issue: Character entities are escaped too aggressively

    Details

    • Type: Bug Bug
    • Status: Open
    • Resolution: Unresolved
    • Affects Version/s: 1.0
    • Fix Version/s: None
    • Labels:
      None
    • Environment:
      Operating System: All
      Platform: All

      Description

      We are using SOAP to send XML documents from client to server and back. The
      documents contain a lot of non-ASCII data. This is encoded as UTF-8 by us.
      However, when retrieved from an Axis server, Axis will escape almost all of our
      characters into character entities (so &#... This means messages become about
      three times as big as they have to for 'international' documents, which for us
      is a large performance problem. I narrowed down the problem to
      XMLUtils::xmlEncodeString
      that has the code:
      if (((int)chars[i]) > 127) {
      strBuf.append("&#");
      strBuf.append((int)chars[i]);
      strBuf.append(";");
      This seems unnecessary to me, as Axis will send all messages in UTF-8 anyway,
      for which no encoding is necessary (and should encoding be configurable, I feel
      this should be escaped elsewhere).

      Is there any reason for this code, I commented it out and it seemed to have no
      adverse effect on our application (apart from reduced network traffic)?

      Tested with 1.0, also looked up in the sources of 1.1-rc2.

      1. TESTCASE_2342.txt
        3 kB
        Christian Müller
      2. TEST_2342.diff
        5 kB
        Rodrigo Ruiz
      3. PATCH_2342.txt
        2 kB
        Christian Müller
      4. AXIS_2342.diff
        3 kB
        Rodrigo Ruiz

        Issue Links

          Activity

          No work has yet been logged on this issue.

            People

            • Assignee:
              Unassigned
              Reporter:
              Thiago Jung Bauermann
            • Votes:
              3 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:

                Development