[MAPREDUCE-4705] Historyserver links expire before the history data does - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Critical
Resolution: Fixed
Affects Version/s: 0.23.3
Fix Version/s: 2.0.3-alpha, 0.23.5
Component/s: jobhistoryserver, mrv2
Labels:
None

Target Version/s:

2.0.3-alpha, 0.23.5
Hadoop Flags:

Reviewed

Description

The historyserver can serve up links to jobs that become useless well before the job history files are purged. For example on a large, heavily used cluster we can end up rotating through the maximum number of jobs the historyserver can track fairly quickly. If a user was investigating an issue with a job using a saved historyserver URL, that URL can become useless because the historyserver has forgotten about the job even though the history files are still sitting in HDFS.

We can tell the historyserver to keep track of more jobs by increasing mapreduce.jobhistory.joblist.cache.size, but this has a direct impact on the responsiveness of the main historyserver page since it serves up all the entries to the client at once. It looks like Hadoop 1.x avoided this issue by encoding the history file location into the URLs served up by the historyserver, so it didn't have to track a mapping between job ID and history file location.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

MAPREDUCE-4705.patch
08/Oct/12 21:03
6 kB
Jason Darrell Lowe

Activity

People

Assignee:: Jason Darrell Lowe

Reporter:: Jason Darrell Lowe

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 04/Oct/12 21:32

Updated:: 06/Feb/13 17:05

Resolved:: 09/Oct/12 03:26