[HDFS-324] Generate a network infrastructre map - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Duplicate
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

Assuming an inhomogeneous network it might be sensible to not only collect metrics about to and from the nodes but also

collect latency between nodes
network io between nodes
daytime depending network io between nodes (i.e. network is much slower in the morning time and in the early evening time)
network failures

Assuming the we collect aging information over a period of time, it would allow us to create a "learning cloud" in senses of its infrastructure. From this a network map we can

see subclusters of high speed linked nodes*
see unreliable connections between nodes
see heavily used links over the time

These information can be refed into the DFS (for data distribution as well as for the balancer) logic to increase its reliability and its performance a lot.

*Note : This could already be managed with the rack awareness, but the aging approach would make this much more fine grained and in an automatic manner.

Attachments

Issue Links

is duplicated by

HDFS-289 HDFS should blacklist datanodes that are not performing well

Open

is related to

HADOOP-2830 Need to instrument Hadoop to get comprehensive network traffic metrics

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: Kai Mosebach

Votes:: 0 Vote for this issue

Watchers:: 10 Start watching this issue

Dates

Created:: 05/Sep/08 15:18

Updated:: 19/Jul/14 18:57

Resolved:: 19/Jul/14 18:57