[MESOS-8725] Support max_duration for tasks - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.6.0
Component/s: None
Labels:
None

Target Version/s:

1.6.0

Description

In our environment, we run a lot of batch jobs, some of which have tight timeline. If any tasks in the job runs longer than x hours, it does not make sense to run it anymore.

For instance, a team would submit a job which builds a weekly index and repeats every Monday. If the job does not finish before next Monday for whatever reason, there is no point to keep any task running.

We believe that implementing deadline tracking distributed across our cluster makes more sense as it makes the system more scalable and also makes our centralized state machine simpler.

One idea I have right now is to add an optional TimeInfo deadline to TaskInfo field, and all default executors in Mesos can simply terminate the task and send a proper StatusUpdate.

Attachments

Activity

People

Assignee:: Zhitao Li

Reporter:: Zhitao Li

Shepherd:: James Peach

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 22/Mar/18 17:05

Updated:: 03/May/18 16:18

Resolved:: 03/May/18 16:18