[HDFS-13157] Do Not Remove Blocks Sequentially During Decommission - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Patch Available
Priority: Major
Resolution: Unresolved
Affects Version/s: 3.0.0
Fix Version/s: None
Component/s: datanode, namenode
Labels:
None

Description

From what I understand of DataNode decommissioning it appears that all the blocks are scheduled for removal in order.. I'm not 100% sure what the ordering is exactly, but I think it loops through each data volume and schedules each block to be replicated elsewhere. The net affect is that during a decommission, all of the DataNode transfer threads slam on a single volume until it is cleaned out. At which point, they all slam on the next volume, etc.

Please randomize the block list so that there is a more even distribution across all volumes when decommissioning a node.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-13157.1.patch
02/Sep/19 00:45
7 kB
David Mollitor

Issue Links

links to

GitHub Pull Request #1391

Activity

People

Assignee:: David Mollitor

Reporter:: David Mollitor

Votes:: 1 Vote for this issue

Watchers:: 18 Start watching this issue

Dates

Created:: 16/Feb/18 16:57

Updated:: 01/Aug/20 02:59