[HDFS-4949] Centralized cache management in HDFS - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 2.3.0, 3.0.0-alpha1
Fix Version/s: 2.3.0
Component/s: datanode, namenode
Labels:
None

Description

HDFS currently has no support for managing or exposing in-memory caches at datanodes. This makes it harder for higher level application frameworks like Hive, Pig, and Impala to effectively use cluster memory, because they cannot explicitly cache important datasets or place their tasks for memory locality.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

caching-design-doc-2013-07-02.pdf
02/Jul/13 20:43
270 kB
Andrew Wang
caching-design-doc-2013-08-09.pdf
09/Aug/13 21:30
305 kB
Andrew Wang
caching-design-doc-2013-10-24.pdf
24/Oct/13 23:47
312 kB
Colin McCabe
caching-testplan.pdf
24/Oct/13 02:43
99 kB
Stephen Chu
hdfs-4949-branch-2.patch
21/Jan/14 22:56
698 kB
Andrew Wang
HDFS-4949-consolidated.patch
25/Oct/13 02:04
503 kB
Andrew Wang

Issue Links

is related to

HDFS-9957 HDFS's use of mlock() is not portable

Open

HDFS-5203 Concurrent clients that add a cache directive on the same path may prematurely uncache from each other.

Resolved

HDFS-5385 Caching RPCs are AtMostOnce, but do not persist client ID and call ID to edit log.

Resolved

HDFS-5388 Loading fsimage fails to find cache pools during namenode startup.

Resolved

HDFS-5195 Prevent passing null pointer to mlock and munlock.

Resolved

HDFS-5266 ElasticByteBufferPool#Key does not implement equals.

Resolved

HDFS-5269 Attempting to remove a cache directive fails with NullPointerException.

Resolved

HDFS-5313 NameNode hangs during startup trying to apply OP_ADD_PATH_BASED_CACHE_DIRECTIVE.

Resolved

HDFS-5373 hdfs cacheadmin -addDirective short usage does not mention -replication parameter.

Resolved

YARN-1488 Allow containers to delegate resources to another container

Open

HDFS-4953 enable HDFS local reads via mmap

Resolved

HDFS-5197 Document dfs.cachereport.intervalMsec in hdfs-default.xml.

Resolved

relates to

HIVE-6347 ZeroCopy read path for ORC RecordReader

Resolved

HDFS-2832 Enable support for heterogeneous storages in HDFS - DN as a collection of storages

Closed

HDFS-16650 Optimize the cost of obtaining timestamps in Centralized cache management

In Progress

HDFS-5202 Support Centralized Cache Management on Windows.

Closed

(7 is related to, 4 relates to)

Sub-Tasks

1.	Add JNI mlock support	Resolved	Andrew Wang
2.	Propagate cache status information from the DataNode to the NameNode	Resolved	Andrew Wang
3.	Add DataNode support for mlock and munlock	Resolved	Andrew Wang
4.	Add cacheRequest/uncacheRequest support to NameNode	Resolved	Colin McCabe
5.	NameNode should invoke DataNode APIs to coordinate caching	Resolved	Andrew Wang
6.	add RPCs for creating and manipulating cache pools	Resolved	Colin McCabe
7.	add command-line support for manipulating cache pools	Resolved	Colin McCabe
8.	Add cache status information to datanode heartbeat	Resolved	Andrew Wang
9.	add command-line support for manipulating cache directives	Resolved	Colin McCabe
10.	miscellaneous cache pool RPC fixes	Resolved	Colin McCabe
11.	prettier dfsadmin -listCachePools output	Resolved	Colin McCabe
12.	revisit zero-copy API in FSDataInputStream to make it more intuitive	Resolved	Colin McCabe
13.	Automatically cache new data added to a cached path	Resolved	Colin McCabe
14.	Persist CacheManager state in the edit log	Resolved	Andrew Wang
15.	Support for federated cache pools	Resolved	Andrew Wang
16.	caching PB cleanups	Resolved	Colin McCabe
17.	Move cache pool related CLI commands to CacheAdmin	Resolved	Andrew Wang
18.	NameNodeRpcServer must not send back DNA_FINALIZE in reply to a cache report	Resolved	Colin McCabe
19.	NativeIO: consolidate getrlimit into NativeIO#getMemlockLimit	Resolved	Colin McCabe
20.	Fix some failing unit tests on HDFS-4949 branch	Resolved	Andrew Wang
21.	separate PathBasedCacheEntry and PathBasedCacheDirectiveWithId	Resolved	Colin McCabe
22.	Refactor PathBasedCache* methods to use a Path rather than a String	Resolved	Chris Nauroth
23.	Change PathBasedCacheDirective APIs to be a single value rather than batch	Resolved	Andrew Wang
24.	Add requesting user's name to PathBasedCacheEntry	Resolved	Andrew Wang
25.	Expose if a block replica is cached in getFileBlockLocations	Resolved	Andrew Wang
26.	Fix failing caching unit tests	Resolved	Andrew Wang
27.	Do not expose CachePool type in AddCachePoolOp	Resolved	Colin McCabe
28.	Add datanode caching metrics	Closed	Andrew Wang
29.	add modifyDirective to cacheAdmin	Resolved	Colin McCabe
30.	Fix error message when dfs.datanode.max.locked.memory is improperly configured	Resolved	Colin McCabe
31.	DNA_CACHE and DNA_UNCACHE should be by blockId only	Resolved	Colin McCabe
32.	Add replication field to PathBasedCacheDirective	Resolved	Colin McCabe
33.	Allow LightWeightGSet#Iterator to remove elements	Resolved	Colin McCabe
34.	recaching improvements	Closed	Colin McCabe
35.	In CacheReport, don't send genstamp and length on the wire	Resolved	Colin McCabe
36.	fix broken caching unit tests	Resolved	Andrew Wang
37.	Loading fsimage fails to find cache pools during namenode startup.	Resolved	Chris Nauroth
38.	Concurrent clients that add a cache directive on the same path may prematurely uncache from each other.	Resolved	Chris Nauroth
39.	fix race conditions in DN caching and uncaching	Closed	Colin McCabe
40.	Add feature documentation for datanode caching.	Closed	Colin McCabe
41.	Caching RPCs are AtMostOnce, but do not persist client ID and call ID to edit log.	Resolved	Chris Nauroth
42.	Resolve regressions in Windows compatibility on HDFS-4949 branch.	Resolved	Chris Nauroth
43.	Fix possible RetryCache hang for caching RPC handlers in FSNamesystem	Resolved	Andrew Wang
44.	Fixup test-patch.sh warnings on HDFS-4949 branch	Resolved	Andrew Wang
45.	Support TTL on CacheDirectives	Closed	Andrew Wang
46.	support cachepool-based limit management in path-based caching	Closed	Andrew Wang
47.	better API for getting the cached blocks locations	Closed	Andrew Wang
48.	Add byte and file statistics to PathBasedCacheEntry	Closed	Colin McCabe
49.	Consistent naming of user-visible caching classes and methods	Closed	Colin McCabe
50.	add command-line support for modifyDirective	Resolved	Colin McCabe
51.	Consider maximum DN memory, stale status when scheduling recaching	Resolved	Colin McCabe
52.	TestPathBasedCacheRequests#testReplicationFactor is flaky	Closed	Andrew Wang
53.	improve CacheManipulator interface to allow better unit testing	Closed	Colin McCabe
54.	loading cache path directives from edit log doesn't update nextEntryId	Closed	Colin McCabe
55.	skip checksums when reading a cached block via non-local reads	Resolved	Colin McCabe
56.	fix narrow race condition in TestPathBasedCacheRequests	Closed	Colin McCabe
57.	Rename "path.based" caching configuration options	Resolved	Andrew Wang
58.	add some more NameNode cache statistics, cache pool stats	Closed	Colin McCabe
59.	Refactor tests in TestCacheDirectives	Resolved	Andrew Wang
60.	CacheAdmin help should match against non-dashed commands	Closed	Andrew Wang
61.	Namenode loops caching and uncaching when data should be uncached	Closed	Andrew Wang
62.	Hook up cache directive and pool usage statistics	Closed	Andrew Wang
63.	allow BlockReaderLocal to switch between checksumming and not	Closed	Colin McCabe
64.	Enforce a max TTL per cache pool	Closed	Andrew Wang
65.	Remove dfs.namenode.caching.enabled and improve CRM locking	Closed	Colin McCabe
66.	The CacheManager throws a NPE in the DataNode logs when processing cache reports that refer to a block not known to the BlockManager	Closed	Colin McCabe
67.	Add per-cache-pool default replication num configuration	Resolved	xupeng

Activity

People

Assignee:: Andrew Wang

Reporter:: Andrew Wang

Votes:: 0 Vote for this issue

Watchers:: 102 Start watching this issue

Dates

Created:: 02/Jul/13 20:40

Updated:: 04/Jul/22 12:49

Resolved:: 22/Jan/14 22:02