Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-1466

Race between executor exited event and launch task can cause overcommit of resources

    XMLWordPrintableJSON

Details

    Description

      The following sequence of events can cause an overcommit

      --> Launch task is called for a task whose executor is already running

      --> Executor's resources are not accounted for on the master

      --> Executor exits and the event is enqueued behind launch tasks on the master

      --> Master sends the task to the slave which needs to commit for resources for task and the (new) executor.

      --> Master processes the executor exited event and re-offers the executor's resources causing an overcommit of resources.

      Attachments

        Issue Links

          Activity

            People

              mzhu Meng Zhu
              vinodkone Vinod Kone
              Vinod Kone Vinod Kone
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: