Details
-
New Feature
-
Status: Closed
-
Major
-
Resolution: Unresolved
-
None
Description
Hive Incremental queries on Hoodie currently suffer a limitation of listing all partitions when a datestr is not present (lists .hoodie and the partitions) and end up throwing away a lot of the files (since `_hoodie_commit_time` column values filters out those files) . This can be very expensive and can impact query planning time and sometime causes timeouts as well if the table is large. The original issue is tracked here - https://github.com/uber/hudi/issues/492
Attachments
Issue Links
- links to