Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-2612

Critical path analyzer for DAGs

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      This analyzer plugin/tool can be used to understand the important vertices/tasks of interest in large DAG for perf analysis / finding bottlenecks. It can be used to find out 1.input dependency, 2. failure dependency, 3. scheduling dependency (may be at later stage).

      Creating this as a uber ticket. Getting this detail at vertex level might be possible with the existing logs derived from ATS. For task level analysis, certain more details are required.

      1. Timeline details like when fetch/merge/compute/sort etc are not captured now. These details can possibly be added in TaskCompletionEvent
      2. Need additional details like the last event that completed processing in the input (for tracing at task level)
      3. Add downstream task attempt that caused the higher level task to get rescheduled/restarted. This can be used in terms of understanding in cases where the task failed due to read-error.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            bikassaha Bikas Saha
            rajesh.balamohan Rajesh Balamohan

            Dates

              Created:
              Updated:

              Slack

                Issue deployment