Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4680

Enable pig job graphs to resume from last successful state

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • impl
    • None

    Description

      Pig scripts can have multiple ETL jobs in the DAG which may take hours to finish. In case of transient errors, the job fails. When the job is rerun, all the nodes in Job graph will rerun. Some of these nodes may have already run successfully. Redundant runs lead to wastage of cluster capacity and pipeline delays.

      In case of failure, we can persist the graph state. In next run, only the failed nodes and their successors will rerun. This is of course subject to preconditions such as

      • Pig script has not changed
      • Input locations have not changed
      • Output data from previous run is intact
      • Configuration has not changed

      Attachments

        1. PIG-4680.patch
          43 kB
          Abhishek Agarwal
        2. patch_recover
          74 kB
          Prateek Vaishnav

        Issue Links

          Activity

            People

              Pratyy Prateek Vaishnav
              abhishek.agarwal Abhishek Agarwal
              Votes:
              1 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated: