Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Consider a case where attempts for the final stage in a long DAG fails due to out of memory. In such a scenario, the framework ( or via the base vertex manager ) should be able to change the task specifications on the fly to trigger a re-run with modified specs.
Changes could be both java opts changes as well as container resource requirements.
Attachments
Attachments
Issue Links
- depends upon
-
YARN-2091 Add more values to ContainerExitStatus and pass it from NM to RM and then to app masters
- Closed