Description
Currently in Group Communication/IMRU, when one of the Evaluators fails, the parent of the task will be stuck, and so does grand parents. We need to identify all the possible cases that may cause such failures, and send the proper state back to driver.
Attachments
Issue Links
- Is contained by
-
REEF-1223 IMRU Fault Tolerance - restart failed evaluators
- Resolved