Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-45084

ProgressReport should include an accurate effective shuffle partition number

    XMLWordPrintableJSON

Details

    Description

      Currently, there is a numShufflePartitions "metric" reported inĀ 
      StateOperatorProgress part of the progress report. However, the number is reported by aggregating executors so in the case of task retry or speculative executor, the metric is higher than number of shuffle partitions for the query plan. Number of shuffle partitions can be useful for reporting purpose so having a metric is helpful.

      Attachments

        Activity

          People

            siying Siying Dong
            siying Siying Dong
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: