[HADOOP-74] hash blocks into dfs.data.dirs - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Duplicate
Affects Version/s: 0.2.0
Fix Version/s: 0.2.0
Component/s: None
Labels:
None
Environment:

large clusters

Description

When dfs.data.dir has multiple values, we currently start a DataNode for each (all in the same JVM). Instead we should run a single DataNode that stores block files into the different directories. This will reduce the number of connections to the namenode. We cannot hash because different devices might be different amounts full. So the datanode will need to keep a table mapping from block id to file location, and add new blocks to less full devices.

Attachments

Issue Links

is duplicated by

HADOOP-64 DataNode should be capable of managing multiple volumes

Closed

Activity

People

Assignee:: Konstantin Shvachko

Reporter:: Doug Cutting

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 11/Mar/06 06:33

Updated:: 08/Jul/09 16:41

Resolved:: 25/Mar/06 06:18