Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
FileInputFormat::getSplits uses FileSystem::globStatus to determine its inputs. When the glob returns directories, each is traversed and LocatedFileStatus instances are returned with the block locations. However, when the glob returns files, this is a FileStatus that requires a second RPC to obtain its locations.