Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-3896

RMNode transitioned from RUNNING to REBOOTED because its response id had not been reset synchronously

Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      2015-07-03 16:49:39,075 INFO org.apache.hadoop.yarn.util.RackResolver: Resolved 10.208.132.153 to /default-rack
      2015-07-03 16:49:39,075 INFO org.apache.hadoop.yarn.server.resourcemanager.ResourceTrackerService: Reconnect from the node at: 10.208.132.153
      2015-07-03 16:49:39,075 INFO org.apache.hadoop.yarn.server.resourcemanager.ResourceTrackerService: NodeManager from node 10.208.132.153(cmPort: 8041 httpPort: 8080) registered with capability: <memory:6144, vCores:60, diskCapacity:213>, assigned nodeId 10.208.132.153:8041
      2015-07-03 16:49:39,104 INFO org.apache.hadoop.yarn.server.resourcemanager.ResourceTrackerService: Too far behind rm response id:2506413 nm response id:0
      2015-07-03 16:49:39,137 INFO org.apache.hadoop.yarn.server.resourcemanager.rmnode.RMNodeImpl: Deactivating Node 10.208.132.153:8041 as it is now REBOOTED
      2015-07-03 16:49:39,137 INFO org.apache.hadoop.yarn.server.resourcemanager.rmnode.RMNodeImpl: 10.208.132.153:8041 Node Transitioned from RUNNING to REBOOTED
      

      The node(10.208.132.153) reconnected with RM. When it registered with RM, RM set its lastNodeHeartbeatResponse's id to 0 asynchronously. But the node's heartbeat come before RM succeeded setting the id to 0.

      Attachments

        1. YARN-3896.07.patch
          9 kB
          Jun Gong
        2. YARN-3896.06.patch
          9 kB
          Jun Gong
        3. YARN-3896.05.patch
          10 kB
          Jun Gong
        4. YARN-3896.04.patch
          10 kB
          Jun Gong
        5. YARN-3896.03.patch
          10 kB
          Jun Gong
        6. YARN-3896.02.patch
          6 kB
          Jun Gong
        7. YARN-3896.01.patch
          3 kB
          Jun Gong
        8. 0001-YARN-3896.patch
          9 kB
          Rohith Sharma K S

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            hex108 Jun Gong Assign to me
            hex108 Jun Gong
            Votes:
            0 Vote for this issue
            Watchers:
            12 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment