Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.0.2, 1.1.0
-
None
Description
At least the following reasons can lead to a task being killed:
- The executor decided to kill it, e.g., if it was unhealthy.
- The framework requested the kill.
- An operator requested the kill.
The executors bundled with Mesos don't set a reason in the TaskStatus message, making it hard to debug task kills.
I think that we should extend the TaskStatus.Reason enum with reasons for those three cases.
In the case of health check failures, the executors should also populate the message field with useful debugging information.
Attachments
Issue Links
- is related to
-
MESOS-1930 Expose TASK_KILLED reason.
- Accepted
- relates to
-
MESOS-6905 Task status updates caused by task health update do not set appropriate reason.
- Resolved