Details
-
Improvement
-
Status: In Progress
-
Major
-
Resolution: Unresolved
-
3.1.0
-
None
-
None
Description
This is a follow up of https://issues.apache.org/jira/browse/SPARK-23243
To completely fix that problem, Spark needs to be able to rollback a result stage and rerun all the result tasks.
However, the result stage may do file committing, which does not support re-commit a task currently. We should either support to rollback a committed task, or abort the entire committing and do it again.
Attachments
Issue Links
- relates to
-
SPARK-23243 Shuffle+Repartition on an RDD could lead to incorrect answers
- Resolved
-
SPARK-25341 Support rolling back a shuffle map stage and re-generate the shuffle files
- Resolved
- links to