Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
Incompatible change, Reviewed
-
Changed GetFileBlockLocations to return topology information for nodes that host the block replicas.
Description
MultiFileInputFormat and FileInputFormat should use block locality information to construct splits.
Attachments
Attachments
Issue Links
- blocks
-
HADOOP-3293 When an input split spans cross block boundary, the split location should be the host having most of bytes on it.
- Closed
-
HADOOP-4565 MultiFileInputSplit can use data locality information to create splits
- Closed