Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
2.2.0
-
None
-
None
Description
Currently, the JHS has no information about jobs whose AMs have failed. This is because the History is written by the AM to the intermediate folder just before finishing, so when it fails for any reason, this information isn't copied there. However, it is not lost as its in the AM's staging directory. To make the History available in the JHS, all we need to do is have another mechanism to move the History from the staging directory to the intermediate directory. The AM also writes a "Summary" file before exiting normally, which is also unavailable when the AM fails.
Attachments
Attachments
Issue Links
- depends upon
-
YARN-1731 ResourceManager should record killed ApplicationMasters for History
- Resolved
- is duplicated by
-
MAPREDUCE-5418 JobHistoryServer has no information about applications if the MR-AM crashes
- Resolved