Commons Digester
  1. Commons Digester
  2. DIGESTER-124

xml attribute values containing "]" character incorrect after 7th occurrence

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Invalid
    • Affects Version/s: 1.8
    • Fix Version/s: None
    • Labels:
      None
    • Environment:

      Red Hat Enterprise Linux Client release 5.1.
      Linux 2.6.18-53.1.19.el5 #1 SMP Tue Apr 22 03:01:10 EDT 2008 x86_64 x86_64 x86_64 GNU/Linux
      java jdk1.5.0_11

      Description

      While configuring a program with regex patterns to inventory filenames to a database, I discovered that after 6 attributes containing the "]" character to a single element, the values are not read reliably.

      this digester_conf.xml
      <toplevel>
      <stringset
      attr1="1]"
      attr2="2]"
      attr3="3]"
      attr4="4]"
      attr5="5]"
      attr6="6]"
      attr7="7]seven"
      attr8="8]!"
      attr9="9]"
      attr10="ten]"
      />
      </toplevel>

      results in output (attr7 has attr8 overlaid, attr9 has attr10 overlaid):
      String set:
      attr1 =1]
      attr2 =2]
      attr3 =3]
      attr4 =4]
      attr5 =5]
      attr6 =6]
      attr7 =8]!even
      attr8 =8]!
      attr9 =te
      attr10=ten]

      I used maven 2 to create an executable jar file. The same executable jar run on MacOS 10.4.11, java version "1.5.0_13" gives correct output.

      This indicates to me that the error may be in some layer lower than digester, but I think it will take a digester developer to isolate the component at fault and file a bug report on the lower level component.

      1. TryParser.java
        1 kB
        Simon Kitching
      2. tryDigesterConfigLimit.tgz
        4 kB
        Ken Tanaka

        Activity

        Hide
        Ken Tanaka added a comment -

        Example code to illustrate the problem. This is a maven 2 project for linux.

        The assembly-assembly.sh is a shortcut command for compiling:
        mvn clean assembly:assembly

        The run.pl script is a shortcut to typing the execution command:
        java -jar target/tryDigesterConfigLimit-1.0-SNAPSHOT-jar-with-dependencies.jar src/main/config/digester_conf.xml src/main/config/log4j_conf.xml

        Show
        Ken Tanaka added a comment - Example code to illustrate the problem. This is a maven 2 project for linux. The assembly-assembly.sh is a shortcut command for compiling: mvn clean assembly:assembly The run.pl script is a shortcut to typing the execution command: java -jar target/tryDigesterConfigLimit-1.0-SNAPSHOT-jar-with-dependencies.jar src/main/config/digester_conf.xml src/main/config/log4j_conf.xml
        Hide
        Simon Kitching added a comment -

        This appears to be a bug in the xml parser bundled with Java, and not a Digester issue.
        Attached is a file that just uses raw SAX apis to dump the attribute data, and the same issue appears.

        I've had a brief look at the xml spec, and can't see anything that forbids ] inside an attribute. As far as I can see, only "<" and "&" are prohibited.
        http://www.w3.org/TR/2006/REC-xml11-20060816/

        So I suggest filing this bug with Sun.

        Regards, Simon

        Show
        Simon Kitching added a comment - This appears to be a bug in the xml parser bundled with Java, and not a Digester issue. Attached is a file that just uses raw SAX apis to dump the attribute data, and the same issue appears. I've had a brief look at the xml spec, and can't see anything that forbids ] inside an attribute. As far as I can see, only "<" and "&" are prohibited. http://www.w3.org/TR/2006/REC-xml11-20060816/ So I suggest filing this bug with Sun. Regards, Simon
        Hide
        Simon Kitching added a comment -

        Show that bad data is provided by a raw sax parser for this specific input file.

        Show
        Simon Kitching added a comment - Show that bad data is provided by a raw sax parser for this specific input file.
        Hide
        Simon Kitching added a comment -

        This does appear to be a bug, but it isn't a bug with Digester and there is nothing that Digester can do to work around it.

        Show
        Simon Kitching added a comment - This does appear to be a bug, but it isn't a bug with Digester and there is nothing that Digester can do to work around it.
        Hide
        Ken Tanaka added a comment - - edited

        Thanks for looking into this Simon. Should I be the one that files a bug
        with Sun? Or was that suggestion for someone else?

        -Ken

        Show
        Ken Tanaka added a comment - - edited Thanks for looking into this Simon. Should I be the one that files a bug with Sun? Or was that suggestion for someone else? -Ken
        Hide
        Simon Kitching added a comment -

        That was a suggestion for you, Ken. It's your discovery

        Feel free to attach the example I wrote.

        If you can't be bothered, then I might get around to filing this myself. Eventually. But it is no easier for me to raise a bug with Sun than it is for you.

        Show
        Simon Kitching added a comment - That was a suggestion for you, Ken. It's your discovery Feel free to attach the example I wrote. If you can't be bothered, then I might get around to filing this myself. Eventually. But it is no easier for me to raise a bug with Sun than it is for you.
        Hide
        Ken Tanaka added a comment - - edited

        Simon,

        I reported this as a bug to Sun on Monday, I don't know how long it takes for them to check the report out. They (IncidentDaemon@sun.com) said:

        Your report has been assigned an internal review ID of 1274504, which is NOT visible on the Sun Developer Network (SDN).

        I included the SAX parser test code you sent on to me. Hopefully they will work on this soon.

        -Ken

        Show
        Ken Tanaka added a comment - - edited Simon, I reported this as a bug to Sun on Monday, I don't know how long it takes for them to check the report out. They (IncidentDaemon@sun.com) said: Your report has been assigned an internal review ID of 1274504, which is NOT visible on the Sun Developer Network (SDN). I included the SAX parser test code you sent on to me. Hopefully they will work on this soon. -Ken

          People

          • Assignee:
            Unassigned
            Reporter:
            Ken Tanaka
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development