Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Hive uses file system metadata to estimate number of reducers and to determine if jobs can be executed locally. This currently looks up file system metadata about each path serially and can take a long time in case number of files is very high.
instead we can lookup part of the input space and try to approximate the size etc. summaries
Attachments
Issue Links
- relates to
-
HIVE-17004 Calculating Number Of Reducers Looks At All Files
- Open