Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-2612 Critical path analyzer for DAGs
  3. TEZ-2646

Add scheduling casual dependency for attempts

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.8.0-alpha
    • None
    • None
    • Reviewed

    Description

      When a task gets scheduled then we dont know what caused it. Some possibilities are
      1) initial scheduling by the vertex manager - causality determined by VM. E.g. dynamic partition pruning VM in Hive can point causality to the attempt that sent it the stats needed to complete the partition pruning logic.
      2) re-scheduling due to own previous version failure - causality points to the previous version that just failed
      3) re-scheduling because read error reported by consumer - causality points to the consumer attempt that reported the error and caused the scheduling.

      This causality relationship can be used to stitch together scheduling dependencies in the execution timeline of the DAG.

      Attachments

        1. TEZ-2646.1.patch
          40 kB
          Bikas Saha

        Activity

          People

            bikassaha Bikas Saha
            bikassaha Bikas Saha
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: