Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-3334 Tez Custom Shuffle Handler
  3. TEZ-3363

Delete intermediate data at the vertex level for Shuffle Handler

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.10.2
    • None
    • None

    Description

      For applications like pig where processing times can be very long, applications may choose to delete intermediate data for a sub dag. For example if a DAG has synced data to HDFS, all upstream intermediate data can be safely deleted.

      Attachments

        1. TEZ-3363.001.patch
          78 kB
          Kuhu Shukla
        2. TEZ-3363.002.patch
          78 kB
          Kuhu Shukla
        3. TEZ-3363.03.patch
          82 kB
          Syed Shameerur Rahman

        Issue Links

          Activity

            People

              srahman Syed Shameerur Rahman
              jeagles Jonathan Turner Eagles
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 11.5h
                  11.5h