Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-5950

Consider request/response for reconciliation, bulk reconcile

    XMLWordPrintableJSON

    Details

      Description

      The current task reconciliation API has a few quirks:

      1. The master will sometimes use "send nothing" as a way to communicate information (MESOS-4050), which is very confusing in a distributed system that might drop messages for other reasons.
      2. A framework has no way to determine when the reconciliation results for a given reconciliation request are "complete". That is, when a framework sends a reconciliation request, it starts to receive zero or more task status updates (with reason set to REASON_RECONCILIATION). The framework can't easily determine how many results it should expect to receive.
      3. For efficiency (and perhaps to simplify framework logic), it might be easier to send a batch of task status updates together in a single message, rather than sending potentially tens of thousands of individual messages.

      For #2, arguably a framework shouldn't need to know when it has seen the "complete" set of results for a reconciliation request. However, supporting a "request/reply" structure for reconciliation can simplify framework logic, especially if a framework might have multiple timers/reasons to be doing reconciliation at the same time.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                neilc Neil Conway
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated: