Details
-
Sub-task
-
Status: Open
-
Major
-
Resolution: Unresolved
-
0.3.0
-
None
-
None
-
None
Description
Branch: hive-14535
When running "insert overwrite...on partitioned table" with 2000+ partitions, good amount of time (~245 seconds) was spent in iterating every mmDirectory entry and checking its file listings in S3. Creating this jira to consider optimizing this codepath, as information from getMmDirectoryCandidates could be used in terms of reducing the number of times S3 needs to be contacted.