Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-1829

Formalize per Input/Output counters

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Per Input / Output counters provide a much better view of how different Inputs / Outputs in a task executed. This isn't enabled by default since there's some work left.
      Putting down some points which need to be figured out to get these enabled by default.

      • How are the counters represented. What we have today is GROUP_NAMES changing to reflect the Input/Output. The counter names under this remains unchanged.
      • How will the counters be aggregated - Is there any point in an aggregate view of BYTES_READ across all Inputs, instead of being aggregated only on the basis of the specific Input.
      • Access from the client. Do we want to expose an interface to access Input / Output parameters. Currently this is accessed at a task level, and users would have to know the Input/Output name to figure out which counter to fetch programatically. We can leave this as is, and at a later point enhance the getCounters API with information for the IOs.

      Attachments

        Activity

          People

            Unassigned Unassigned
            sseth Siddharth Seth
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: