Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-9052

Default executor should commit suicide if it cannot receive HTTP responses for LAUNCH_NESTED_CONTAINER calls.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.4.0, 1.5.0, 1.6.0, 1.7.0
    • None
    • executor
    • None

    Description

      If there is a network problem (e.g., a routing problem), it is possible that the agent has received LAUNCH_NESTED_CONTAINER calls from the default executor and launched the nested container, but the executor does not get the HTTP response. This would result in tasks stuck at TASK_STARTING forever. We should consider making the default executor commit suicide if it does not receive the response in a reasonable amount of time.

      Attachments

        Activity

          People

            gkleiman Gastón Kleiman
            chhsia0 Chun-Hung Hsiao
            Qian Zhang Qian Zhang
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: