Description
This jira implements the new FileSystem API as proposed in HDFS-202. The new API aims to eliminate individual "getFileBlockLocations" calls to NN for each file in the input directory of a job. Instead, a file's block locations are returned together with FileStatus when listing a directory, thus improving getSplits performance.
Attachments
Attachments
Issue Links
- is related to
-
HDFS-202 Add a bulk FIleSystem.getFileBlockLocations
- Closed
- relates to
-
HADOOP-6900 FileSystem#listLocatedStatus should not throw generic RuntimeException to indicate error conditions
- Closed
-
HADOOP-6890 Improve listFiles API introduced by HADOOP-6870
- Closed