Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-14894

Add balancer parameter to balance top used nodes

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Done
    • None
    • None
    • balancer & mover
    • None

    Description

      We sometimes see a few of our datanodes reach very high usage (due to various reasons) and we need to reduce their usage in an urgent situation.

      We see two ways to achieve it currently,

      -Calculate and reset balancing threshold.

      -Pick nodes manually according to usage stats and put them in a file and use `-resource` flag.

      However, both of them are not very intuitive or too much manual work in an urgent close-to-outage situation. Add a small feature to automatically pick top used hosts will be a straightforward option, for example `-sourceThreshold 95` to only target datanodes with >95% usage. 

      Attachments

        1. HDFS-14894.001.patch
          9 kB
          Leon Gao

        Issue Links

          Activity

            People

              LeonG Leon Gao
              LeonG Leon Gao
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: