Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-5947

Improve dead node detection and handling

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.23.0, 2.0.0-alpha, 3.0.0-alpha1
    • None
    • namenode
    • None

    Description

      When HeartbeatManager.heartbeatCheck runs:

      1. All DNs are scanned to count dead nodes
      2. Processes the first dead node
      3. If there was a dead node, loops to re-scan all DNs again

      Processing the dead node holds the namesystem write lock while removing the node from the blockmap. It also appears to do a lot of work to immediately re-adjust the replication queues. All this processing might be too expensive while holding the write lock, ex. if a rack or two is lost.

      Attachments

        Activity

          People

            Unassigned Unassigned
            daryn Daryn Sharp
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

            Dates

              Created:
              Updated: