Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-6743

Docker executor hangs forever if `docker stop` fails.

    XMLWordPrintableJSON

    Details

    • Sprint:
      Mesosphere Sprint 61
    • Story Points:
      5

      Description

      If docker stop finishes with an error status, the executor should catch this and react instead of indefinitely waiting for reaped to return.

      An interesting question is how to react. Here are possible solutions.

      1. Retry docker stop. In this case it is unclear how many times to retry and what to do if docker stop continues to fail.

      2. Unmark task as killed. This will allow frameworks to retry the kill. However, in this case it is unclear what status updates we should send: TASK_KILLING for every kill retry? an extra update when we failed to kill a task? or set a specific reason in TASK_KILLING?

      3. Clean up and exit. In this case we should make sure the task container is killed or notify the framework and the operator that the container may still be running.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                abudnik Andrei Budnik
                Reporter:
                alexr Alex R
                Shepherd:
                Alex R
              • Votes:
                1 Vote for this issue
                Watchers:
                8 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: