Details
Description
The current management unit in Hadoop is a node, i.e. if a node failed, it will be kicked out and all the data on the node will be replicated.
As almost all SATA controller support hotplug, we add a new command line interface to datanode, thus it can list, add or remove a volume online, which means we can change a disk without node decommission. Moreover, if the failed disk still readable and the node has enouth space, it can migrate data on the disks to other disks in the same node.
A more detailed design document will be attached.
The original version in our lab is implemented against 0.20 datanode directly, and is it better to implemented it in contrib? Or any other suggestion?
Attachments
Attachments
Issue Links
- depends upon
-
HADOOP-7001 Allow configuration changes without restarting configured nodes
- Closed
- is blocked by
-
HDFS-1405 Make FSVolumeSet in FSDataSet pluggable
- Resolved
-
HADOOP-7001 Allow configuration changes without restarting configured nodes
- Closed
- is duplicated by
-
HDFS-664 Add a way to efficiently replace a disk in a live datanode
- Resolved
-
HDFS-4239 Means of telling the datanode to stop using a sick disk
- Resolved
- is related to
-
HDFS-2137 Datanode Disk Fail Inplace
- Resolved
- is required by
-
HDFS-4239 Means of telling the datanode to stop using a sick disk
- Resolved