Uploaded image for project: 'Xerces2-J'
  1. Xerces2-J
  2. XERCESJ-977

Null pointer exception during DOM parsing

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.6.2
    • 2.9.1
    • DOM (Level 3 Core)
    • None

    Description

      We are parsing large numbers of xml files with DOM and are very occasionally getting a null pointer exception when parsing. In this case we tracked the problem down to a point in the text where there was a quoted attribute inside quoted text:
      [a]nimals should be housed in facilities dedicated to or assigned for that purpose...<bibr rid="b2"/>”

      Any of the following changes to the document would get rid of the null pointer exception and allow parsing:
      1) Changing the bibr tag to a different without any attributes.
      2) Removing the outside quotes.
      3) Moving the bibr tag to outside the quotes.

      Here is the stack trace for the error:
      522316528 [Thread-200] ERROR -> org.apache.xerces.dom.DeferredDocumentImpl.setChunkIndex(Unknown Source)
      522316529 [Thread-200] ERROR -> org.apache.xerces.dom.DeferredDocumentImpl.insertBefore(Unknown Source)
      522316529 [Thread-200] ERROR -> org.apache.xerces.parsers.AbstractDOMParser.endGeneralEntity(Unknown Source)
      522316529 [Thread-200] ERROR -> org.apache.xerces.impl.dtd.XMLDTDValidator.endGeneralEntity(Unknown Source)
      522316529 [Thread-200] ERROR -> org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.endEntity(Unknown Source)
      522316530 [Thread-200] ERROR -> org.apache.xerces.impl.XMLDocumentScannerImpl.endEntity(Unknown Source)
      522316530 [Thread-200] ERROR -> org.apache.xerces.impl.XMLEntityManager.endEntity(Unknown Source)
      522316530 [Thread-200] ERROR -> org.apache.xerces.impl.XMLEntityScanner.load(Unknown Source)
      522316530 [Thread-200] ERROR -> org.apache.xerces.impl.XMLEntityScanner.scanContent(Unknown Source)
      522316530 [Thread-200] ERROR -> org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanContent(Unknown Source)
      522316530 [Thread-200] ERROR -> org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source)
      522316530 [Thread-200] ERROR -> org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source)
      522316531 [Thread-200] ERROR -> org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
      522316531 [Thread-200] ERROR -> org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source)
      522316531 [Thread-200] ERROR -> org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
      522316531 [Thread-200] ERROR -> org.apache.xerces.parsers.DOMParser.parse(Unknown Source)
      522316531 [Thread-200] ERROR -> org.apache.xerces.jaxp.DocumentBuilderImpl.parse(Unknown Source)
      522316531 [Thread-200] ERROR -> javax.xml.parsers.DocumentBuilder.parse(Unknown Source)

      Attachments

        1. x6.xml
          386 kB
          Ed Tyrrill
        2. event_catalog.dtd
          3 kB
          Ed Tyrrill

        Issue Links

          Activity

            People

              mrglavas@ca.ibm.com Michael Glavassevich
              efhorton Emily Horton
              Votes:
              2 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: