[FALCON-1677] Support re-tries for timed-out instances - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: trunk, 0.9
Component/s: None
Labels:
None

Description

Currently, Falcon retries only on failure. We should extend support in case of timed-out instances too. Earlier, since we were relying on post-processing to notify the instance status, this was not possible. Now that Falcon relies on Oozie JMS notifications, we can support retries for timed out instances too.

If a dataset is expected to get delayed for a long time, the user is currently forced to supply a large timeout value. This is an overhead in terms of Oozie having to poll for that long. If we introduce retries, the timeout can be a reasonable value with periodic/exponential back-off retries.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

FALCON-1677-v0.patch
11/Jan/16 17:15
5 kB
Narayan Periwal
FALCON-1677-v1.patch
12/Jan/16 08:18
7 kB
Narayan Periwal
FALCON-1677-v2.patch
12/Jan/16 08:38
7 kB
Narayan Periwal
FALCON-1677-v3.patch
13/Jan/16 11:35
12 kB
Narayan Periwal

Issue Links

breaks

FALCON-2060 Retry does not happen if instance timedout

Resolved

links to

Review Board

Activity

People

Assignee:: Narayan Periwal

Reporter:: Pallavi Rao

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 21/Dec/15 05:38

Updated:: 08/Aug/16 09:38

Resolved:: 14/Jan/16 11:06