Description
Seems like certain combination of the container sizes launched by AM, causing the AM to keep restarting.
Following exception is seen in the app master container log:
Aug 12, 2016 4:37:39 PM com.google.common.util.concurrent.AbstractExecutionThreadService$1$1 run WARNING: Error while attempting to shut down the service after failure. java.lang.NullPointerException at org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl.decResourceRequest(AMRMClientImpl.java:687) at org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl.removeContainerRequest(AMRMClientImpl.java:477) at org.apache.twill.internal.yarn.Hadoop21YarnAMClient.removeContainerRequest(Hadoop21YarnAMClient.java:116) at org.apache.twill.internal.yarn.Hadoop21YarnAMClient.removeContainerRequest(Hadoop21YarnAMClient.java:45) at org.apache.twill.internal.yarn.AbstractYarnAMClient.allocate(AbstractYarnAMClient.java:119) at org.apache.twill.internal.appmaster.ApplicationMasterService.doStop(ApplicationMasterService.java:281) at org.apache.twill.internal.AbstractTwillService.shutDown(AbstractTwillService.java:186) at com.google.common.util.concurrent.AbstractExecutionThreadService$1$1.run(AbstractExecutionThreadService.java:55) at java.lang.Thread.run(Thread.java:745) Exception in thread "ApplicationMasterService" java.lang.NullPointerException at org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl.decResourceRequest(AMRMClientImpl.java:687) at org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl.removeContainerRequest(AMRMClientImpl.java:477) at org.apache.twill.internal.yarn.Hadoop21YarnAMClient.removeContainerRequest(Hadoop21YarnAMClient.java:116) at org.apache.twill.internal.yarn.Hadoop21YarnAMClient.removeContainerRequest(Hadoop21YarnAMClient.java:45) at org.apache.twill.internal.yarn.AbstractYarnAMClient.allocate(AbstractYarnAMClient.java:119) at org.apache.twill.internal.appmaster.ApplicationMasterService.doRun(ApplicationMasterService.java:369) at org.apache.twill.internal.AbstractTwillService.run(AbstractTwillService.java:179) at com.google.common.util.concurrent.AbstractExecutionThreadService$1$1.run(AbstractExecutionThreadService.java:52) at java.lang.Thread.run(Thread.java:745)
Attachments
Issue Links
- links to