Details
-
Task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
I think it would be useful to add a new module for fuzzing. We should eventually integrate grownup/mature fuzzers (jqf), but we can start with cheap 'n easy mangling.
The reason to add this to tika and not have it as a separate project, is that I think there will eventually be file format specific fuzzers for file formats that rely on compressed streams, e.g. PDF, docx, where mangling a literal byte in the file is a good first step, but it might also be useful to mangle bytes within the streams or implement faulty/malicious compressors or inject malfeasance at other levels.
I'm not set on adding this to Tika. If there are objections/recommendations, please share.