[HDFS-4879] Add "blocked ArrayList" collection to avoid CMS full GCs - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 2.0.4-alpha, 3.0.0-alpha1
Fix Version/s: 2.3.0
Component/s: namenode
Labels:
None

Target Version/s:

Description

We recently saw an issue where a large deletion was issued which caused 25M blocks to be collected during deleteInternal. Currently, the list of collected blocks is an ArrayList, meaning that we had to allocate a contiguous 25M-entry array (~400MB). After a NN has been running for a long amount of time, the old generation may become fragmented such that it's hard to find a 400MB contiguous chunk of heap.

In general, we should try to design the NN such that the only large objects are long-lived and created at startup time. We can improve this particular case (and perhaps some others) by introducing a new List implementation which is made of a linked list of arrays, each of which is size-limited (eg to 1MB).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

hdfs-4879.txt
27/Aug/13 23:05
16 kB
Todd Lipcon
hdfs-4879.txt
08/Jul/13 18:25
16 kB
Todd Lipcon
hdfs-4879.txt
04/Jun/13 21:15
49 kB
Todd Lipcon
hdfs-4879.txt
04/Jun/13 20:46
48 kB
Todd Lipcon

Activity

People

Assignee:: Todd Lipcon

Reporter:: Todd Lipcon

Votes:: 0 Vote for this issue

Watchers:: 19 Start watching this issue

Dates

Created:: 04/Jun/13 19:51

Updated:: 12/May/16 18:12

Resolved:: 06/Sep/13 19:06