Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-1071

ResourceManager's decommissioned and lost node count is 0 after restart

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.1.0-beta
    • Fix Version/s: 2.4.0
    • Component/s: resourcemanager
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      I had 6 nodes in a cluster with 2 NMs stopped. Then I put a host into YARN's yarn.resourcemanager.nodes.exclude-path. After running yarn rmadmin -refreshNodes, RM's JMX correctly showed decommissioned node count:

      "NumActiveNMs" : 3,
      "NumDecommissionedNMs" : 1,
      "NumLostNMs" : 2,
      "NumUnhealthyNMs" : 0,
      "NumRebootedNMs" : 0
      

      After restarting RM, the counts were shown as below in JMX.

      "NumActiveNMs" : 3,
      "NumDecommissionedNMs" : 0,
      "NumLostNMs" : 0,
      "NumUnhealthyNMs" : 0,
      "NumRebootedNMs" : 0
      

      Notice that the lost and decommissioned NM counts are both 0.

        Attachments

        1. YARN-1071.1.patch
          10 kB
          Jian He
        2. YARN-1071.2.patch
          12 kB
          Jian He
        3. YARN-1071.3.patch
          12 kB
          Jian He
        4. YARN-1071.4.patch
          12 kB
          Jian He
        5. YARN-1071.5.patch
          12 kB
          Jian He
        6. YARN-1071.6.patch
          13 kB
          Jian He

          Issue Links

            Activity

              People

              • Assignee:
                jianhe Jian He
                Reporter:
                srimanth.gunturi Srimanth Gunturi
              • Votes:
                0 Vote for this issue
                Watchers:
                9 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: