Details
-
Improvement
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
None
-
None
-
None
Description
YARN sometimes kills containers. Most commonly, when those containers outgrow their resource limit. REEF today doesn't provide any indication about the reason for the lost Evaluator in the FailedEvaluator message. We should investigate whether we can do a better job here, e.g. if YARN provides us with a reason for the kill we can relay.