Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.23.2
-
None
Description
We had an instance where the RM went down for more then an hour. The application master exited with "Could not contact RM after 360000 milliseconds"
2012-04-11 10:43:36,040 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: job_1333003059741_15999Job Transitioned from RUNNING to ERROR