Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-8125

Agent should properly handle recovering an executor when its pid is reused

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • None
    • 1.4.2, 1.5.1, 1.6.0
    • docker
    • None
    • Mesosphere Sprint 73
    • 2

    Description

      Here's how to reproduce this issue:

      1. Start a task using the Docker containerizer (the same will probably happen with the command executor).
      2. Stop the corresponding Mesos agent while the task is running.
      3. Change the executor's checkpointed forked pid, which is located in the meta directory, e.g., /var/lib/mesos/slave/meta/slaves/latest/frameworks/19faf6e0-3917-48ab-8b8e-97ec4f9ed41e-0001/executors/foo.13faee90-b5f0-11e7-8032-e607d2b4348c/runs/latest/pids/forked.pid. I used pid 2, which is normally used by kthreadd.
      4. Reboot the host

      Attachments

        Issue Links

          Activity

            People

              qianzhang Qian Zhang
              gkleiman Gastón Kleiman
              Vinod Kone Vinod Kone
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: