Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
Description
Right now, the FAILURE event doesn't provide much context about why the slave was removed and/or why the executor terminated. A reason field, similar to TaskStatus.reason, would help users differentiate (e.g., between slaves that are removed for maintenance versus those that are removed because of a network partition; or between the various different executor termination scenarios).
Attachments
Issue Links
- is related to
-
MESOS-4548 Errors communicated to the scheduler should be associated with stable error codes.
- Open