[MESOS-810] New Scheduler API - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Story
Status: Resolved
Priority: Major
Resolution: Incomplete
Affects Version/s: None
Fix Version/s: None
Component/s: c++ api, java api, python api
Labels:
None

Description

This ticket is an effort to capture requirements and link to related tickets for the future version of the Scheduler API. We should split these out as needed but for now I'll just document things off the top of my head:

1. Batch status update acknowledgements. The current mechanism for acknowledging a status update is for the call to Scheduler::statusUpdate to return. At this point we send an acknowledgement to the slave. This simplistic approach forces schedulers to serially persist status updates if operating in a stateful manner, ultimately leading to scaling issues.

2. Explicit behavior. By this I mean that when a Scheduler calls something like killTask, we'll currently implicitly drop it when we're disconnected from the Master. Rather than implicitly doing this, we either need to deliver things reliably or inform schedulers when their request was dropped. Returning Futures could be very powerful here.

3. Statefulness and access to state. The scheduler driver is currently stateless (no persistence). This means schedulers currently have to persist state using our State abstraction or the replicated log, or through their own persistence mechanism. Providing a stateful scheduler driver increases the simplicity of framework schedulers substantially, and providing access to state makes it very simple to implement a framework.

Attachments

Issue Links

relates to

MESOS-2288 Scheduler API v1

Resolved

MESOS-2347 Add ability for schedulers to explicitly acknowledge status updates on the driver.

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: Benjamin Mahler

Votes:: 1 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 12/Nov/13 22:35

Updated:: 28/Feb/16 22:40

Resolved:: 06/Apr/15 18:47