[HDFS-14658] Refine NameSystem lock usage during processing FBR - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Abandoned
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

The disk with 12TB capacity is very normal today, which means the FBR size is much larger than before, BlockManager holds the NameSystemLock during processing block report for each storage, which might take quite a long time.

On our production environment, processing large FBR usually cause a longer RPC queue time, which impacts client latency, so we did some simple work on refining the lock usage, which improved the p99 latency significantly.

In our solution, BlockManager release the NameSystem write lock and request it again for every 5000 blocks(by default) during processing FBR, with the fair lock, all the RPC request can be processed before BlockManager re-acquire the write lock.

Attachments

Issue Links

duplicates

HDFS-14657 Refine NameSystem lock usage during processing FBR

Patch Available

Activity

People

Assignee:: Unassigned

Reporter:: Chen Zhang

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 17/Jul/19 09:21

Updated:: 17/Jul/19 16:36

Resolved:: 17/Jul/19 10:02