Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
None
-
None
-
None
-
None
Description
The current DFS does not detect slow links (nodes).
Thus, when a node or its network link is slow, it may affect the overall system performance significantly.
Specifically, when a map job needs to read data from such a node, it may progress 10X slower.
And when a DFS data node pipeline consists of such a node, the write performance degrades significantly.
This may lead to some long tails for map/reduce jobs. We have experienced such behaviors quite often.