[HDFS-11419] BlockPlacementPolicyDefault is choosing datanode in an inefficient way - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Done
Affects Version/s: None
Fix Version/s: None
Component/s: namenode
Labels:
None

Target Version/s:

2.9.0

Description

Currently in BlockPlacementPolicyDefault, chooseTarget will end up calling into chooseRandom, which will first find a random datanode by calling

DatanodeDescriptor chosenNode = chooseDataNode(scope, excludedNodes);

, then it checks whether that returned datanode satisfies storage type requirement

storage = chooseStorage4Block(
              chosenNode, blocksize, results, entry.getKey());

If yes, numOfReplicas--;, otherwise, the node is added to excluded nodes, and runs the loop again until numOfReplicas is down to 0.

A problem here is that, storage type is not being considered only until after a random node is already returned. We've seen a case where a cluster has a large number of datanodes, while only a few satisfy the storage type condition. So, for the most part, this code blindly picks random datanodes that do not satisfy the storage type requirement.

To make matters worse, the way NetworkTopology#chooseRandom works is that, given a set of excluded nodes, it first finds a random datanodes, then if it is in excluded nodes set, try find another random nodes. So the more excluded nodes there are, the more likely a random node will be in the excluded set, in which case we basically wasted one iteration.

Therefore, this JIRA proposes to augment/modify the relevant classes in a way that datanodes can be found more efficiently. There are currently two different high level solutions we are considering:

1. add some field to Node base types to describe the storage type info, and when searching for a node, we take into account such field(s), and do not return node that does not meet the storage type requirement.

2. change NetworkTopology class to be aware of storage types, e.g. for one storage type, there is one tree subset that connects all the nodes with that type. And one search happens on only one such subset. So unexpected storage types are simply not in the search space.

Thanks szetszwo for the offline discussion, and thanks linyiqun for pointing out a wrong statement (corrected now) in the description. Any further comments are more than welcome.

Attachments

Issue Links

is related to

HDFS-9807 Add an optional StorageID to writes

Resolved

Sub-Tasks

1.	Separate class InnerNode from class NetworkTopology and make it extendable	Resolved	Tsz-wo Sze
2.	HDFS specific network topology classes with storage type info included	Resolved	Chen Liang
3.	Add storage type demand to into DFSNetworkTopology#chooseRandom	Resolved	Chen Liang
4.	DFSTopologyNodeImpl#chooseRandom optimizations	Resolved	Chen Liang
5.	Enable DFSNetworkTopology as default	Resolved	Chen Liang
6.	Combine the old and the new chooseRandom for better performance	Resolved	Chen Liang
7.	Use HDFS specific network topology to choose datanode in BlockPlacementPolicyDefault	Resolved	Yiqun Lin
8.	Performance analysis of new DFSNetworkTopology#chooseRandom	Resolved	Chen Liang
9.	Improve the selection in choosing storage for blocks	Patch Available	Yiqun Lin
10.	Stress test of DFSNetworkTopology	Patch Available	Chen Liang

Activity

People

Assignee:: Chen Liang

Reporter:: Chen Liang

Votes:: 0 Vote for this issue

Watchers:: 33 Start watching this issue

Dates

Created:: 15/Feb/17 22:00

Updated:: 24/Apr/18 20:50

Resolved:: 31/Jan/18 20:21