Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
None
-
None
Description
In ExternalSorter's spillToMergeableFile() method, we pass ShuffleWriteMetrics instances to the disk writers, but discard the shuffleWriteTime metrics captured here. I think that we should account for this IO time, possibly by introducing new metrics to distinguish time spent writing spills vs. writing final shuffle output and extending the UI to break down the overall IO write time in terms of these two components.
Attachments
Issue Links
- duplicates
-
SPARK-3577 Add task metric to report spill time
- Resolved
- is part of
-
SPARK-7416 Shuffle performance metrics umbrella
- Closed