Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-4420

Provide a way to exclude subtree from balancing process

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Duplicate
    • None
    • None
    • balancer & mover
    • None

    Description

      During balancer operation, it balances all blocks, regardless of their filesystem hierarchy. Sometimes, it would be usefull to exclude some subtree from balancing process.

      For example, regionservers data locality is cruical for HBase performance. Region's data is tied to regionservers, which reside on specific machines in cluster. During operation, regionservers reads and writes region's data, and after some time, all this data are reside on local machine, so, all reads become local, which is great for performance. Balancer breaks this locality during opertation by moving blocks around.

      This patch adds [-exclude <path>] switch, and, if path is provided,
      balancer will not move blocks under this path during operation.

      Attached patch have tested for 0.90.2.

      Attachments

        1. HDFS-4420-v4.patch
          14 kB
          Max Lapan
        2. Balancer-exclude-trunk-v3.patch
          13 kB
          Max Lapan
        3. Balancer-exclude-trunk-v2.patch
          12 kB
          Max Lapan
        4. Balancer-exclude-trunk.patch
          8 kB
          Max Lapan
        5. Balancer-exclude-subtree-0.90.2.patch
          8 kB
          Max Lapan

        Issue Links

          Activity

            People

              Unassigned Unassigned
              shmuma Max Lapan
              Votes:
              1 Vote for this issue
              Watchers:
              28 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: