Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2430

Add at least dev test capability to run Tika against fuzzed files

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.17
    • None
    • None

    Description

      lfcnassif observed on TIKA-2428 that a corrupt file caused a permanent hang for the EMFParser. Files can be corrupted for various reasons. We can add some optional code to let people experiment with running Tika against randomly corrupted versions of the files in our test suite. I suspect that this will unearth too many errors to start to be run on a regular basis.

      Let's at least add some code in tika-parsers to let devs run the tests.

      Attachments

        Activity

          People

            tallison Tim Allison
            tallison Tim Allison
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: