Details
-
Improvement
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
2.1.0-beta
-
None
-
None
-
Reviewed
-
Description
There are chances MRAppMaster crush during job committing,or NodeManager restart cause the committing AM exit due to container expire.In these cases ,the job will fail.
However,some jobs can redo commit so failing the job becomes unnecessary.
Let clients tell AM to allow redo commit or not is a better choice.
This idea comes from Jason Lowe's comments in MAPREDUCE-4819
Attachments
Attachments
Issue Links
- breaks
-
MAPREDUCE-6555 TestMRAppMaster fails on trunk
- Resolved
-
MAPREDUCE-6595 Fix findbugs warnings in OutputCommitter and FileOutputCommitter
- Resolved
- is depended upon by
-
MAPREDUCE-6608 Work Preserving AM Restart for MapReduce
- Open
- is duplicated by
-
MAPREDUCE-6437 Add retry on some connection exception on job commit phase
- Resolved
- is related to
-
MAPREDUCE-6545 Test committer.commitJob() behavior during committing when MR AM get failed.
- Open
- relates to
-
MAPREDUCE-6478 Add an option to skip cleanupJob stage or ignore cleanup failure during commitJob().
- Resolved
-
MAPREDUCE-4815 Speed up FileOutputCommitter#commitJob for many output files
- Closed