Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
Reviewed
Description
Balancer is inefficient in the following case:
Datanode | Utilization | Rack |
---|---|---|
D1 | 95% | A |
D2 | 30% | B |
D3, D4, D5 | 0% | B |
The average utilization is 25% so that D2 is within 10% threshold. However, Balancer currently will first move blocks from D2 to D3, D4 and D5 since they are under the same rack. Then, it will move blocks from D1.
Attachments
Attachments
Issue Links
- is depended upon by
-
HDFS-8923 Add -source flag to balancer usage message
- Resolved
- is duplicated by
-
HDFS-5958 One very large node in a cluster prevents balancer from balancing data
- Resolved
- is related to
-
HDFS-8921 Add an option to Balancer for specifying the k-most over-utilized DNs or all over-utilized DNs as sources.
- Open
-
HDFS-11008 Change unit test for testing parsing "-source" parameter in Balancer CLI
- Resolved