Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3009

XML Parser reset() detection no working in weblogic 12.2.1.3

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 1.20, 1.21, 1.22, 1.23
    • 1.25
    • core
    • None
    • JDK 1.8.0_231

      Oracle Weblogic Server 12.2.1.3

    Description

      Starting with tika 1.20 the org.apache.tika.utils.XMLReaderUtils try to detect if a XML parser supports the reset() functionality by calling reset() during the poolParser creation and watching for a UnsupportedOperationException.

      This unfortunately does not work in weblogic server as the attained RegistryParser itself caches underlying SAX parsers. Only after first use the reset() of the underlying SAXParser is called and will produce the UnsupportedOperationException. A first call to reset() will not produce this exception and XMLReaderUtils thinks, the parser supports reset() which in effect is not true.

      This results in exhaustion of the parser pool and intermittent errors and delays in processing as the pool is reset when a parser is not available after 5 minutes.

      Attachments

        Activity

          People

            tallison Tim Allison
            dani2305 Daniel
            Votes:
            2 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: