Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
Reviewed
Description
MAPREDUCE-3121 makes NodeManager identify disk failures. But once a disk goes down, it is marked as failed forever. To reuse that disk (after it becomes good), NodeManager needs restart. This JIRA is to improve NodeManager to reuse good disks(which could be bad some time back).
Attachments
Attachments
Issue Links
- breaks
-
YARN-4393 TestResourceLocalizationService#testFailedDirsResourceRelease fails intermittently
- Closed
- duplicates
-
YARN-1196 LocalDirsHandlerService never change failedDirs back to normal even when these disks turn good
- Resolved
-
YARN-1380 Enable NM to automatically reuse failed local dirs after they are available again
- Resolved
- is duplicated by
-
YARN-2473 YARN never cleans up container directories from a full disk
- Resolved
-
YARN-4011 Jobs fail since nm-local-dir not cleaned up when rogue job fills up disk
- Resolved
-
YARN-2488 Nodemanager should support check if bad disks have turned to good ones
- Resolved
- is related to
-
YARN-2839 YARN minicluster doesn't bail out if all the NM disks are dead
- Open
-
YARN-2799 cleanup TestLogAggregationService based on the change in YARN-90
- Closed