Uploaded image for project: 'XalanJ2'
  1. XalanJ2
  2. XALANJ-2398

parsing big XMLs take very long time. A JAXRPC webservice request of soap size 2MB takes above 5 mins to complete parsing and start processing web service.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Reopened
    • Major
    • Resolution: Unresolved
    • None
    • None
    • transformation
    • None
    • Operating System: All
      Platform: All

    Description

      parsing bix XMLs take very long time. A JAXRPC webservice request of soap size 2MB takes above 5 mins to complete parsing and start processing web service.
      Observed that the SOAP messages are read line by line and CharacterDataImpl.append() is getting called for each line. This appends each line to the string data which stores the previously read xml part. This call results in many string additions (say 20000+ for 1MB SOAP) of big string and the thread is most of the time blocked in StringBuilder.expandcapacity due to long string additions.

      JAX RPC webservice with SOAP messages bigger than 1MB take 5 mins and more just for the web service implementation to start working as the jaxrpc does a SOAPMessage.getenvelop call to do HandlerChainImpl.checkMustUnderstand() checking for the request message header. Same problem happens for response also. for time being we commented checkMustUnderstand method so that the parsing doesn't happen at all.

      String additions for each of the lines would have been avoided, either by creating the whole data single time or using string buffer instead of string. I tried to modify the data field to use StringBuffer instead of String but the underlying CoreDocumentImpl.modifiedCharacterData() and all underlying calls takes the string params so couldn't go ahead with it.

      bug # XERCESJ-102 looks like the same issue..

      Attachments

        1. JaxRPCProcessRequest.java
          4 kB
          SHEREEF ABDULLA
        2. WrkingBigSOAP.txt
          849 kB
          SHEREEF ABDULLA

        Activity

          People

            Unassigned Unassigned
            sabdulla SHEREEF ABDULLA
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: