Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-5177

blocksScheduled count should be decremented for abandoned blocks

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.1.0-beta, 3.0.0-alpha1
    • 2.8.0, 3.0.0-alpha1
    • namenode
    • None
    • Reviewed

    Description

      DatanodeDescriptor#incBlocksScheduled() will be called for all datanodes of the block on each allocation. But same should be decremented for abandoned blocks.

      When one of the datanodes is down and same is allocated for the block along with other live datanodes, then this block will be abandoned, but the scheduled count on other datanodes will consider live datanodes as loaded, but in reality these datanodes may not be loaded.

      Anyway this scheduled count will be rolled every 20 mins.

      Problem will come if the rate of creation of files is more. Due to increase in the scheduled count, there might be chances of missing local datanode to write to. and some times writes also can fail in small clusters.

      So we need to decrement the unnecessary count on abandon block call.

      Attachments

        1. HDFS-5177.patch
          14 kB
          Vinayakumar B
        2. HDFS-5177.patch
          13 kB
          Vinayakumar B
        3. HDFS-5177.patch
          13 kB
          Vinayakumar B
        4. HDFS-5177-04.patch
          6 kB
          Vinayakumar B
        5. HDFS-5177-05.patch
          5 kB
          Vinayakumar B

        Activity

          People

            vinayakumarb Vinayakumar B
            vinayakumarb Vinayakumar B
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: