Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
1.11
-
None
-
None
Description
If for example one kills an inject or generate job, Nutch does not clean up 'temporary' directories and I have witnessed them remain within HDFS. This is far from ideal if we have a large team of users all hammering away on Yarn and persisting data into HDFS.
We should investigate how to clean up these directories such that a cluster admin is not left with all of the dross at the end of the long day
Attachments
Issue Links
- is superceded by
-
NUTCH-2518 Must check return value of job.waitForCompletion()
- Closed