[SPARK-32470] Remove task result size check for shuffle map stage - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: 2.4.6, 3.0.0, 3.1.0
Fix Version/s: 3.1.0
Component/s: Spark Core
Labels:
None

Description

The task result of a shuffle map stage is not the query result but instead is only map status and metrics accumulator updates. Aside from the metrics that can vary in size, the total task result size solely depends on the number of tasks. And the number of tasks can get large regardless of the stage's output size. For example, the number of tasks generated by `CartesianProduct` is square of "spark.sql.shuffle.partitions", say if "spark.sql.shuffle.partitions" is set to 200, you get 40,000 tasks, if set to 500, you get 250,000 tasks, which can easily error on the default limit of `spark.driver.maxResultSize`:

org.apache.spark.SparkException: Job aborted due to stage failure: Total size of serialized results of 66496 tasks (4.0 GiB) is bigger than spark.driver.maxResultSize (4.0 GiB)

However, map status and accumulator updates are used by the driver to update the overall map stats and metrics of the query, and they are not cached on the driver, so they won't cause catastrophic memory issues on the driver. So we should remove this check for shuffle map stage tasks.

Attachments

Issue Links

relates to

SPARK-36071 Spark driver requires large memory space for serialized results even there are no data collected to the driver

Resolved

links to

[Github] Pull Request #29276 (maryannxue)

Activity

People

Assignee:: Wei Xue

Reporter:: Wei Xue

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 28/Jul/20 18:04

Updated:: 15/Sep/21 02:04

Resolved:: 11/Aug/20 06:31