Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-2599

Dont send obsoleted data movement events to tasks

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.8.0-alpha
    • None
    • None
    • Reviewed

    Description

      Since events are sent in bulk and in sequential order to tasks they can end up getting a bunch of data movement events in round 1 and then an input failed event in round 2. The task may end up scheduling fetches for these obsoleted events before round 2, leading to wastage.
      Given that an input failed event and data movement event can be matched via their source task attempt ids, the AM can use this match to obsolete the data movements events ahead of time and not send them to the tasks in the first place.
      The input failed events still need to be sent to the tasks, so that they can obsolete any data movements events that they may have received much earlier from the failed task attempt.

      Attachments

        1. TEZ-2599.2.patch
          15 kB
          Bikas Saha
        2. TEZ-2599.1.patch
          14 kB
          Bikas Saha

        Issue Links

          Activity

            People

              bikassaha Bikas Saha
              bikassaha Bikas Saha
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: