[MAPREDUCE-5268] Improve history server startup performance - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.23.7, 2.0.4-alpha
Fix Version/s: 2.1.0-beta, 0.23.9
Component/s: jobhistoryserver
Labels:
None

Hadoop Flags:

Reviewed

Description

The history server can easily take many minutes to startup when there are a significant number of jobs to scan in the done directory. However the scanning of files is not the bottleneck, rather it's the heavy use of ConcurrentSkipListMap.size in HistoryFileManager.

ConcurrentSkipListMap.size is a very expensive operation, especially on maps with many entries, as it has to scan every entry to compute the size. We should avoid calling this method or at least minimize its use.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

mr-5268.patch
31/May/13 17:57
10 kB
Karthik Kambatla
mr-5268.patch
31/May/13 15:10
11 kB
Karthik Kambatla
mr-5268.patch
31/May/13 00:40
11 kB
Karthik Kambatla
mr-5268.patch
29/May/13 18:46
10 kB
Karthik Kambatla
mr-5268.patch
29/May/13 07:41
9 kB
Karthik Kambatla
mr-5268-prelim.patch
29/May/13 02:35
5 kB
Karthik Kambatla

Activity

People

Assignee:: Karthik Kambatla

Reporter:: Jason Darrell Lowe

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 22/May/13 20:12

Updated:: 03/Nov/14 18:05

Resolved:: 03/Jun/13 14:53