Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-6743

Docker executor hangs forever if `docker stop` fails.

Attach filesAttach ScreenshotVotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Mesosphere Sprint 61
    • 5

    Description

      If docker stop finishes with an error status, the executor should catch this and react instead of indefinitely waiting for reaped to return.

      An interesting question is how to react. Here are possible solutions.

      1. Retry docker stop. In this case it is unclear how many times to retry and what to do if docker stop continues to fail.

      2. Unmark task as killed. This will allow frameworks to retry the kill. However, in this case it is unclear what status updates we should send: TASK_KILLING for every kill retry? an extra update when we failed to kill a task? or set a specific reason in TASK_KILLING?

      3. Clean up and exit. In this case we should make sure the task container is killed or notify the framework and the operator that the container may still be running.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            abudnik Andrei Budnik
            alexr Alex R
            Alex R Alex R
            Votes:
            1 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Agile

                Completed Sprint:
                Mesosphere Sprint 61 ended 18/Aug/17
                View on Board

                Slack

                  Issue deployment