Description
Tasks occasionally become stuck in the `TASK_STAGING` state after launching. It appears that this affects both Docker and non-Docker tasks, especially those which start up and fail immediately. Attached is a sample of the slave log as well as screenshots from a testing cluster showing the tasks which are stuck in staging, and then a number of failed tasks which occurs after restarting the slave process. Justin Bieber is provided for scale.
This may be related to MESOS-1837, and quite possibly the same issue, but it remains unclear.
Attachments
Attachments
Issue Links
- blocks
-
MESOS-2579 0.22.1 release
- Resolved
- is related to
-
MESOS-1462 External Containerizer can leave a task indefinitely in STAGING if the `launch` fails
- Resolved
- relates to
-
MESOS-998 Slave should wait until Containerizer::update() completes successfully
- Resolved