Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-5355 YARN Timeline Service v.2: alpha 2
  3. YARN-6801

NPE in RM while setting collectors map in NodeHeartbeatResponse

    XMLWordPrintableJSON

    Details

    • Hadoop Flags:
      Reviewed

      Description

      Null Pointer Exception seen in ResourceTrackerService#setAppCollectorsMapToResponse call

      2017-06-22 22:24:01,437 WARN org.apache.hadoop.ipc.Server: IPC Server handler 49 on 8031, call org.apache.hadoop.yarn.server.api.ResourceTrackerPB.nodeHeartbeat from 10.35.172.116:44399 Call#3929 Retry#0
      java.lang.NullPointerException
              at org.apache.hadoop.yarn.server.resourcemanager.ResourceTrackerService.setAppCollectorsMapToResponse(ResourceTrackerService.java:467)
              at org.apache.hadoop.yarn.server.resourcemanager.ResourceTrackerService.nodeHeartbeat(ResourceTrackerService.java:447)
              at org.apache.hadoop.yarn.server.api.impl.pb.service.ResourceTrackerPBServiceImpl.nodeHeartbeat(ResourceTrackerPBServiceImpl.java:68)
              at org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$2.callBlockingMethod(ResourceTracker.java:81)
              at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:619)
              at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:962)
              at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2084)
              at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2080)
              at java.security.AccessController.doPrivileged(Native Method)
              at javax.security.auth.Subject.doAs(Subject.java:415)
              at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1645)
              at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2078)
      

      It correlates to RM invoking setAppCollectorsMapToResponse and calling

            AppCollectorData appCollectorData = rmApps.get(appId).getCollectorData();
      

      If the app object is not present in the list of running app ids, then this will throw NPE.

      Filing jira to fix it.

        Attachments

          Activity

            People

            • Assignee:
              vrushalic Vrushali C
              Reporter:
              vrushalic Vrushali C
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: