[PIG-4680] Enable pig job graphs to resume from last successful state - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: impl
Labels:
None

Description

Pig scripts can have multiple ETL jobs in the DAG which may take hours to finish. In case of transient errors, the job fails. When the job is rerun, all the nodes in Job graph will rerun. Some of these nodes may have already run successfully. Redundant runs lead to wastage of cluster capacity and pipeline delays.

In case of failure, we can persist the graph state. In next run, only the failed nodes and their successors will rerun. This is of course subject to preconditions such as

Pig script has not changed
Input locations have not changed
Output data from previous run is intact
Configuration has not changed

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

PIG-4680.patch
09/Oct/15 13:26
43 kB
Abhishek Agarwal
patch_recover
28/Mar/16 06:14
74 kB
Prateek Vaishnav

Issue Links

links to

Review request

Activity

People

Assignee:: Prateek Vaishnav

Reporter:: Abhishek Agarwal

Votes:: 1 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 17/Sep/15 12:17

Updated:: 06/Apr/16 07:13