[HDFS-13156] HDFS Block Placement Policy - Client Local Rack - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: 2.9.0, 3.2.0, 3.1.1
Fix Version/s: 3.2.0, 3.3.0
Component/s: documentation
Labels:
None

Hadoop Flags:

Reviewed

Description

For the common case, when the replication factor is three, HDFS’s placement policy is to put one replica on the local machine if the writer is on a datanode, otherwise on a random datanode, another replica on a node in a different (remote) rack, and the last on a different node in the same remote rack.

https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HdfsDesign.html#Replica_Placement:_The_First_Baby_Steps

Having just looked over the Default Block Placement code, the way I understand this, is that, there are three basic scenarios:

HDFS client is running on a datanode inside the cluster
HDFS client is running on a node outside the cluster
HDFS client is running on a non-datanode inside the cluster

The documentation is ambiguous concerning the third scenario. Please correct me if I'm wrong, but the way I understand the code, if there is an HDFS client inside the cluster, but it is not on a datanode, the first block will be placed on a datanode within the set of datanodes available on the local rack and not simply on any random datanode from the set of all datanodes in the cluster.

That is to say, if one rack has an HDFS Sink Flume Agent on a dedicated node, I should expect that every first block will be written to a random datanode on the same rack as the HDFS Flume agent, assuming the network topology script is written to include this Flume node.

If that is correct, can the documentation be updated to include this third common scenario?

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-13156-01.patch
11/Sep/18 18:49
3 kB
Ayush Saxena

Activity

People

Assignee:: Ayush Saxena

Reporter:: David Mollitor

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 16/Feb/18 16:04

Updated:: 12/Oct/18 12:27

Resolved:: 12/Oct/18 12:00