Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Won't Do
-
3.0.0
-
None
-
None
Description
For fixing the correctness bug of SPARK-28699, we disable radix sortĀ for the scenario of repartition in Spark SQL. This will cause a performance regression.
So for limiting the performance overhead, we'll do the optimizing work by only enable sort for the repartition operation while stage retries happening. This work depends on SPARK-25341.
Attachments
Issue Links
- is blocked by
-
SPARK-25341 Support rolling back a shuffle map stage and re-generate the shuffle files
- Resolved
- is caused by
-
SPARK-28699 Cache an indeterminate RDD could lead to incorrect result while stage rerun
- Resolved