Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-3065

HA: Newly active NameNode does not recognize decommissioning DataNode

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • HA branch (HDFS-1623)
    • None
    • ha
    • None

    Description

      I'm working on a cluster where, originally, styx01 hosts the active NameNode and styx02 hosts the standby NameNode.

      In both styx01's and styx02's exclude file, I added the DataNode on styx03.I then ran hdfs dfsadmin -refreshNodes and verified on styx01 NN web UI that the DN on styx03 was decommissioning. After waiting a few minutes, I checked the standby NN web UI (while the DN was decommissioning) and didn't see that the DN was marked as decommissioning.

      I executed manual failover, making styx02 NN active and styx01 NN standby. I checked the newly active NN web UI, and the DN was still not marked as decommissioning, even after a few minutes. However, the newly standby NN's web UI still showed the DN as decommissioning.

      I added another DN to the exclude file, and executed hdfs dfsadmin -refreshNodes, but the styx02 NN web UI still did not update with the decommissioning nodes.

      I failed back over to make styx01 NN active and styx02 NN standby. I checked the styx01 NN web UI and saw that it correctly marked 2 DNs as decommissioning.

      Attachments

        Activity

          People

            Unassigned Unassigned
            schu Stephen Chu
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated: