Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
one replica of a file should be written locally if possible. That's currently not the case.
Copying a 1GB file using hadoop dfs -cp running on one of the cluster nodes, all the blocks were written to remote nodes, as seen by fsck -files -blocks -locations on the newly created file.
as long as there is sufficient space locally, a local copy has significant performance benefits.