Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-8125

Agent should properly handle recovering an executor when its pid is reused

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.4.2, 1.5.1, 1.6.0
    • Component/s: docker
    • Labels:
      None
    • Target Version/s:
    • Sprint:
      Mesosphere Sprint 73
    • Story Points:
      2

      Description

      Here's how to reproduce this issue:

      1. Start a task using the Docker containerizer (the same will probably happen with the command executor).
      2. Stop the corresponding Mesos agent while the task is running.
      3. Change the executor's checkpointed forked pid, which is located in the meta directory, e.g., /var/lib/mesos/slave/meta/slaves/latest/frameworks/19faf6e0-3917-48ab-8b8e-97ec4f9ed41e-0001/executors/foo.13faee90-b5f0-11e7-8032-e607d2b4348c/runs/latest/pids/forked.pid. I used pid 2, which is normally used by kthreadd.
      4. Reboot the host

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                qianzhang Qian Zhang
                Reporter:
                gkleiman Gastón Kleiman
                Shepherd:
                Vinod Kone
              • Votes:
                0 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: