-
Type:
Bug
-
Status: Resolved
-
Priority:
Critical
-
Resolution: Fixed
-
Affects Version/s: None
-
Component/s: capacity scheduler
-
Labels:None
-
Target Version/s:
-
Hadoop Flags:Reviewed
If we use the following queue mapping:
u:%user:%primary_group
then we get a NPE inside ResourceManager:
2020-04-06 11:59:13,883 ERROR resourcemanager.ResourceManager (ResourceManager.java:serviceStart(881)) - Failed to load/recover state java.lang.NullPointerException at java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:936) at org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacitySchedulerQueueManager.getQueue(CapacitySchedulerQueueManager.java:138) at org.apache.hadoop.yarn.server.resourcemanager.placement.UserGroupMappingPlacementRule.getContextForPrimaryGroup(UserGroupMappingPlacementRule.java:163) at org.apache.hadoop.yarn.server.resourcemanager.placement.UserGroupMappingPlacementRule.getPlacementForUser(UserGroupMappingPlacementRule.java:118) at org.apache.hadoop.yarn.server.resourcemanager.placement.UserGroupMappingPlacementRule.getPlacementForApp(UserGroupMappingPlacementRule.java:227) at org.apache.hadoop.yarn.server.resourcemanager.placement.PlacementManager.placeApplication(PlacementManager.java:67) at org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.placeApplication(RMAppManager.java:827) at org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.createAndPopulateNewRMApp(RMAppManager.java:378) at org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.recoverApplication(RMAppManager.java:367) at org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.recover(RMAppManager.java:594) ...
We to check if parent queue is null in UserGroupMappingPlacementRule.getContextForPrimaryGroup().
- is related to
-
YARN-10198 [managedParent].%primary_group mapping rule doesn't work after YARN-9868
-
- Resolved
-