[MAPREDUCE-5641] History for failed Application Masters should be made available to the Job History Server - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 2.2.0
Fix Version/s: None
Component/s: applicationmaster, jobhistoryserver
Labels:
None

Description

Currently, the JHS has no information about jobs whose AMs have failed. This is because the History is written by the AM to the intermediate folder just before finishing, so when it fails for any reason, this information isn't copied there. However, it is not lost as its in the AM's staging directory. To make the History available in the JHS, all we need to do is have another mechanism to move the History from the staging directory to the intermediate directory. The AM also writes a "Summary" file before exiting normally, which is also unavailable when the AM fails.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

MAPREDUCE-5641.patch
20/Feb/14 00:35
27 kB
Robert Kanter
MAPREDUCE-5641.patch
14/Feb/14 02:44
27 kB
Robert Kanter

Issue Links

depends upon

YARN-1731 ResourceManager should record killed ApplicationMasters for History

Resolved

is duplicated by

MAPREDUCE-5418 JobHistoryServer has no information about applications if the MR-AM crashes

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: Robert Kanter

Votes:: 1 Vote for this issue

Watchers:: 18 Start watching this issue

Dates

Created:: 21/Nov/13 22:35

Updated:: 23/Aug/17 09:21