[MAPREDUCE-5066] JobTracker should set a timeout when calling into job.end.notification.url - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 1-win, 2.0.3-alpha, 1.3.0
Fix Version/s: 1.2.0, 1-win, 2.1.0-beta
Component/s: None
Labels:
None

Target Version/s:

1.2.0, 1-win, 2.1.0-beta

Description

In current code, timeout is not specified when JobTracker (JobEndNotifier) calls into the notification URL. When the given URL points to a server that will not respond for a long time, job notifications are completely stuck (given that we have only a single thread processing all notifications). We've seen this cause noticeable delays in job execution in components that rely on job end notifications (like Oozie workflows).

I propose we introduce a configurable timeout option and set a default to a reasonably small value.

If we want, we can also introduce a configurable number of workers processing the notification queue (not sure if this is needed though at this point).

I will prepare a patch soon. Please comment back.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

MAPREDUCE-5066.2.patch
01/Apr/13 02:13
8 kB
Ivan Mitic
MAPREDUCE-5066.3.patch
15/Apr/13 05:04
17 kB
Ivan Mitic
MAPREDUCE-5066.branch-1-win.2.patch
18/Mar/13 00:24
16 kB
Ivan Mitic
MAPREDUCE-5066.branch-1-win.3.patch
01/Apr/13 01:28
16 kB
Ivan Mitic
MAPREDUCE-5066.branch-1-win.4.patch
01/Apr/13 02:13
17 kB
Ivan Mitic
MAPREDUCE-5066.branch-1-win.5.patch
15/Apr/13 05:03
17 kB
Ivan Mitic
MAPREDUCE-5066.branch-1-win.patch
17/Mar/13 21:54
16 kB
Ivan Mitic
MAPREDUCE-5066.patch
01/Apr/13 01:28
7 kB
Ivan Mitic

Issue Links

is duplicated by

MAPREDUCE-4935 Support timeout limitation to MRv1 job end notifications

Resolved

Activity

People

Assignee:: Ivan Mitic

Reporter:: Ivan Mitic

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 14/Mar/13 07:39

Updated:: 22/Aug/13 02:50

Resolved:: 20/Apr/13 19:25