Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
YARN-5355, YARN-5355-branch-2
-
None
-
Reviewed
Description
Null Pointer Exception seen in ResourceTrackerService#setAppCollectorsMapToResponse call
2017-06-22 22:24:01,437 WARN org.apache.hadoop.ipc.Server: IPC Server handler 49 on 8031, call org.apache.hadoop.yarn.server.api.ResourceTrackerPB.nodeHeartbeat from 10.35.172.116:44399 Call#3929 Retry#0 java.lang.NullPointerException at org.apache.hadoop.yarn.server.resourcemanager.ResourceTrackerService.setAppCollectorsMapToResponse(ResourceTrackerService.java:467) at org.apache.hadoop.yarn.server.resourcemanager.ResourceTrackerService.nodeHeartbeat(ResourceTrackerService.java:447) at org.apache.hadoop.yarn.server.api.impl.pb.service.ResourceTrackerPBServiceImpl.nodeHeartbeat(ResourceTrackerPBServiceImpl.java:68) at org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$2.callBlockingMethod(ResourceTracker.java:81) at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:619) at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:962) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2084) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2080) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1645) at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2078)
It correlates to RM invoking setAppCollectorsMapToResponse and calling
AppCollectorData appCollectorData = rmApps.get(appId).getCollectorData();
If the app object is not present in the list of running app ids, then this will throw NPE.
Filing jira to fix it.