Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-14904

Add Option to let Balancer prefer highly utilized nodes in each iteration

    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      Normally the most important purpose for HDFS balancer is to reduce the top used node to prevent datanode usage from being too high.

      Currently, balancer almost randomly picks nodes as sources regardless of usage, which makes it slow to bring down the top used datanodes in the cluster, when there are less underutilized nodes in the cluster (consider expansion).

      We can add an option to prefer top used nodes first in each iteration, as suggested in HDFS-14894 .

      Attachments

        Issue Links

          Activity

            People

              LeonG Leon Gao
              LeonG Leon Gao
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 40m
                  1h 40m