Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1786

Downgrade logging severity in FileResourceConsumer and fix handling of illegal xml characters

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Trivial
    • Resolution: Fixed
    • None
    • 1.12
    • tika-batch
    • None

    Description

      FileResourceConsumer logs an xmlified snippet to record problems encountered during parsing. If a parser includes illegal xml characters in the ParseException, this exception is caught by the xmlification code and then logged as an error.

      The xmlification code should be robust against illegal characters and we should downgrade logging severity from error to warnings when there wasn't an actual error thrown by a parser.

      Attachments

        Activity

          People

            tallison Tim Allison
            tallison Tim Allison
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: