Details
-
Bug
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
None
-
None
-
None
-
New, Patch Available
Description
ExtractReuters in the benchmark module currently fails because it currently creates the output directory, and then calls IOUtils.rm on it (which will remove all files in it as well as removes the output directory itself). This is to fix this behaviour.
While I was at it, I also added a bit more logging in case of file errors (the download I had some bad data) and made the task recover in case of issues with one file.