Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-9957

Sequence all operations on the agent

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • agent

    Description

      The resolution of MESOS-8582 requires that an asynchronous step be added to the code path which applies speculative operations like RESERVE and CREATE on the agent. In order to ensure that the FrameworkInfo associated with an incoming operation will be successfully retained, we must first unschedule GC on the framework meta directory if the framework struct does not exist but that directory does. By introducing this asynchronous step, we allow the possibility that an operation may be executed out-of-order with respect to an incoming dependent LAUNCH or LAUNCH_GROUP.

      For example, if a scheduler issues an ACCEPT call containing both a RESERVE operation as well as a LAUNCH operation containing a task which consumes the new reserved resources, it's possible that this task will be launched on the agent before the reserved resources exist.

      While we already sequence task launches on a per-executor basis, the aforementioned corner case requires that we sequence all offer operations on a per-framework basis.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            greggomann Greg Mann

            Dates

              Created:
              Updated:

              Agile

                Completed Sprint:
                Foundations: RI-18 55 ended 25/Sep/19
                View on Board

                Slack

                  Issue deployment