Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
Reviewed
-
Description
Filing this issue in response to ``full disk woes`` on hdfs-user.
Datanodes fill their storage directories unevenly, leading to situations where certain disks are full while others are significantly less used. Users at many different sites have experienced this issue, and HDFS administrators are taking steps like:
- Manually rebalancing blocks in storage directories
- Decomissioning nodes & later readding them
There's a tradeoff between making use of all available spindles, and filling disks at the sameish rate. Possible solutions include:
- Weighting less-used disks heavier when placing new blocks on the datanode. In write-heavy environments this will still make use of all spindles, equalizing disk use over time.
- Rebalancing blocks locally. This would help equalize disk use as disks are added/replaced in older cluster nodes.
Datanodes should actively manage their local disk so operator intervention is not needed.
Attachments
Attachments
Issue Links
- is duplicated by
-
HDFS-4408 Balancer for DataNode's Volumes
- Resolved
- is related to
-
HDFS-10576 DiskBalancer followup work items
- Resolved
-
HDFS-1804 Add a new block-volume device choosing policy that looks at free space
- Closed
-
HDFS-1121 Allow HDFS client to measure distribution of blocks across devices for a specific DataNode
- Open
-
HDFS-1120 Make DataNode's block-to-device placement policy pluggable
- Resolved
- relates to
-
HDFS-8538 Change the default volume choosing policy to AvailableSpaceVolumeChoosingPolicy
- Patch Available