[MAPREDUCE-551] Add preemption to the fair scheduler - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.21.0
Component/s: contrib/fair-share
Labels:
None

Hadoop Flags:

Reviewed
Release Note:
Added support for preemption in the fair scheduler. The new configuration options for enabling this are described in the fair scheduler documentation.

Description

Task preemption is necessary in a multi-user Hadoop cluster for two reasons: users might submit long-running tasks by mistake (e.g. an infinite loop in a map program), or tasks may be long due to having to process large amounts of data. The Fair Scheduler (~~HADOOP-3746~~) has a concept of guaranteed capacity for certain queues, as well as a goal of providing good performance for interactive jobs on average through fair sharing. Therefore, it will support preempting under two conditions:
1) A job isn't getting its guaranteed share of the cluster for at least T1 seconds.
2) A job is getting significantly less than its fair share for T2 seconds (e.g. less than half its share).

T1 will be chosen smaller than T2 (and will be configurable per queue) to meet guarantees quickly. T2 is meant as a last resort in case non-critical jobs in queues with no guaranteed capacity are being starved.

When deciding which tasks to kill to make room for the job, we will use the following heuristics:

Look for tasks to kill only in jobs that have more than their fair share, ordering these by deficit (most overscheduled jobs first).
For maps: kill tasks that have run for the least amount of time (limiting wasted time).
For reduces: similar to maps, but give extra preference for reduces in the copy phase where there is not much map output per task (at Facebook, we have observed this to be the main time we need preemption - when a job has a long map phase and its reducers are mostly sitting idle and filling up slots).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

fairshare-patches.tar.gz
10/Oct/09 01:14
31 kB
Todd Lipcon
mapreduce-551-branch20.txt
21/Jul/09 06:14
127 kB
Todd Lipcon
hadoop-4665-v7e.patch
24/Jun/09 21:50
132 kB
Matei Alexandru Zaharia
hadoop-4665-v7d.patch
23/Jun/09 23:07
131 kB
Matei Alexandru Zaharia
hadoop-4665-v7c.patch
19/Jun/09 01:24
131 kB
Matei Alexandru Zaharia
hadoop-4665-v7b.patch
05/Jun/09 21:43
130 kB
Matei Alexandru Zaharia
hadoop-4665-v7.patch
02/Jun/09 05:40
130 kB
Matei Alexandru Zaharia
hadoop-4665-v6.patch
16/May/09 21:39
98 kB
Matei Alexandru Zaharia
hadoop-4665-v5.patch
07/May/09 05:26
98 kB
Matei Alexandru Zaharia
hadoop-4665-v4.patch
25/Mar/09 00:01
44 kB
Matei Alexandru Zaharia
hadoop-4665-v3.patch
19/Mar/09 22:35
44 kB
Matei Alexandru Zaharia
hadoop-4665-v2.patch
21/Feb/09 08:14
44 kB
Matei Alexandru Zaharia
hadoop-4665-v1b.patch
09/Feb/09 00:01
45 kB
Matei Alexandru Zaharia
hadoop-4665-v1.patch
08/Feb/09 23:48
45 kB
Matei Alexandru Zaharia
fs-preemption-v0.patch
08/Jan/09 06:42
57 kB
Matei Alexandru Zaharia

Issue Links

is duplicated by

HADOOP-5701 With fair scheduler, long running jobs can easily occurpy a lot of task slots

Resolved

Activity

People

Assignee:: Matei Alexandru Zaharia

Reporter:: Matei Alexandru Zaharia

Votes:: 2 Vote for this issue

Watchers:: 22 Start watching this issue

Dates

Created:: 15/Nov/08 23:51

Updated:: 24/Aug/10 21:13

Resolved:: 27/Jun/09 03:46