Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
0.2.0
-
None
-
None
Description
If a data node is configured with multiple data directories, current implementation of dfs will start multiple data node threads, each of which manages one data directory and talks to its name node independently. From the name node's point of view, it sees multiple data nodes instead of one.
I feel that a more scalable solution should be to start one data node thread that manages multiple data diretories. But the one data node thread needs to take care of the block allocation problem.
Attachments
Issue Links
- is duplicated by
-
HADOOP-64 DataNode should be capable of managing multiple volumes
- Closed