Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-9318

Consider providing better operation status updates while an RP is recovering

    XMLWordPrintableJSON

Details

    Description

      Consider the following scenario:

      1. A framework accepts an offer with an operation affecting SLRP resources.
      2. The master forwards it to the corresponding agent.
      3. The agent forwards it to the corresponding RP.
      4. The agent and the master fail over.
      5. The master recovers.
      6. The agent recovers while the RP is still recovering, so it doesn't include the pending operation on the RegisterMessage.
      7. A framework performs an explicit operation status reconciliation.

      In this case the master will currently respond with OPERATION_UNKNOWN, but it should be possible to respond with a more fine-grained and useful state, such as OPERATION_RECOVERING.

      Attachments

        Issue Links

          Activity

            People

              greggomann Greg Mann
              gkleiman Gastón Kleiman
              Gastón Kleiman Gastón Kleiman
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: