[HDFS-9412] getBlocks occupies FSLock and takes too long to complete - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.8.0, 2.7.4, 3.0.0-alpha1
Component/s: balancer & mover, namenode
Labels:
None

Target Version/s:

2.7.4
Hadoop Flags:

Reviewed
Release Note:
Skip blocks with size below dfs.balancer.getBlocks.min-block-size (default 10MB) when a balancer asks for a list of blocks.

Description

getBlocks in NameNodeRpcServer acquires a read lock then may take a long time to complete (probably several seconds, if number of blocks are too much).
During this period, other threads attempting to acquire write lock will wait.
In an extreme case, RPC handlers are occupied by one reader thread calling getBlocks and all other threads waiting for write lock, rpc server acts like hung. Unfortunately, this tends to happen in heavy loaded cluster, since read operations come and go fast (they do not need to wait), leaving write operations waiting.

Looks like we can optimize this thing like DN block report did in past, by splitting the operation into smaller sub operations, and let other threads do their work between each sub operation. The whole result is returned at once, though (one thing different from DN block report).
I am not sure whether this will work. Any better idea?

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-9412.0000.patch
08/Dec/15 12:44
2 kB
He Tianyi
HDFS-9412.0001.patch
13/Apr/16 12:55
2 kB
He Tianyi
HDFS-9412.0002.patch
14/Apr/16 03:35
4 kB
He Tianyi
HDFS-9412-branch-2.7.00.patch
22/May/17 23:25
4 kB
Konstantin Shvachko

Issue Links

is related to

HDFS-8824 Do not use small blocks for balancing the cluster

Resolved

relates to

HDFS-11855 Backport HDFS-9412 to branch-2.7: getBlocks occupies FSLock and takes too long to complete

Resolved

Activity

People

Assignee:: He Tianyi

Reporter:: He Tianyi

Votes:: 0 Vote for this issue

Watchers:: 17 Start watching this issue

Dates

Created:: 11/Nov/15 11:00

Updated:: 11/Apr/18 22:39

Resolved:: 18/Apr/16 01:46