Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
Reviewed
-
Datanode can continue if a volume for replica storage fails. Previously a datanode resigned if any volume failed.
Description
Current implementation shuts DataNode down completely when one of the configured volumes of the storage fails.
This is rather wasteful behavior because it decreases utilization (good storage becomes unavailable) and imposes extra load on the system (replication of the blocks from the good volumes). These problems will become even more prominent when we move to mixed (heterogeneous) clusters with many more volumes per Data Node.
Attachments
Attachments
Issue Links
- is related to
-
HDFS-138 data node process should not die if one dir goes bad
- Resolved