Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
None
-
None
Description
SPARK-17644 opened up a discussion about further refactoring of the DAGScheduler's handling of FetchFailure events. These include:
- rewriting code and comments to improve readability
- doing fetchFailedAttemptIds.add(stageAttemptId) even when disallowStageRetryForTest is true
- issuing a ResubmitFailedStages event based on whether one is already enqueued for the current failed stage, not any prior failed stage
- logging the resubmission of all failed stages