Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
On restarts, if an evaluator fails, we need to create a brand new EvaluatorManager for it, which calls AllocatedEvaluatorHandler. This does not really make sense as the evaluator was never really "allocated," and we should call EvaluatorFailedHandler directly. In addition, in the C# interop portion for AllocatedEvaluatorHandler, we inspect fields in the Evaluator object, which causes a NullReferenceException because on restarts, we do not initialize certain fields for the "allocated" evaluator. Perhaps there should be a new EvaluatorFailed handler specifically for restarts as well.
Attachments
Issue Links
- blocks
-
REEF-560 Add a configurable timeout for driver to recover evaluators on restart
- Resolved
- links to