Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
The Driver should ACK the Evaluator's DONE heartbeat such that a race condition does not occur when the Evaluator ends. i.e. The Evaluator heartbeats DONE back to the Driver and the RM notices that the Evaluator process has exited. In this case, it is possible that the RM reports back to the Driver that the Evaluator is DONE before the Evaluator's DONE heartbeat goes back to the Driver, causing the Driver to invoke the FailedEvaluatorHandler due to an unexpected DONE message from the RM.