Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-23206 Additional Memory Tuning Metrics
  3. SPARK-34488

Support task Metrics Distributions and executor Metrics Distributions in the REST API call for a specified stage

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.0.2
    • 3.2.0
    • Spark Core
    • None

    Description

      For a specific stage, it is useful to show the task metrics in percentile distribution.  This information can help users know whether or not there is a skew/bottleneck among tasks in a given stage.  We list an example in taskMetricsDistributions.json

      Similarly, it is useful to show the executor metrics in percentile distribution for a specific stage. This information can show whether or not there is a skewed load on some executors.  We list an example in executorMetricsDistributions.json

       

      We define withSummaries query parameter in the REST API for a specific stage as:

      applications/<application_id>/<application_attempt/stages/<stage_id>/<stage_attempt>?withSummaries=[true|false]&quantiles=0.0,0.25,0.5,0.75,1.0

      • When withSummaries=true, both task metrics in percentile distribution and executor metrics in percentile distribution are included in the REST API output.  The default value of withSummaries is false, i.e. no metrics percentile distribution will be included in the REST API output.
      • Query parameter quantiles define the quantiles we use to calculate metrics distributions.  It takes effect only when withSummaries=true.  Its default value is 0.0,0.25,0.5,0.75,1.0.  

       

      Attachments

        Activity

          People

            angerszhuuu angerszhu
            ron8hu Ron Hu
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: