Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
Reviewed
Description
DiskChecker can fail to detect total disk/controller failures indefinitely. We have seen this in real clusters. DiskChecker performs simple permissions-based checks on directories which do not guarantee that any disk IO will be attempted.
A simple improvement is to write some data and flush it to the disk.
Attachments
Attachments
Issue Links
- breaks
-
HADOOP-15450 Avoid fsync storm triggered by DiskChecker and handle disk full situation
- Resolved
- is blocked by
-
HADOOP-13737 Cleanup DiskChecker interface
- Resolved