Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
Data skipping (pruning files) for RLI is supported only when the query predicate has `EqualTo` or `In` expressions/filters on the record-key column. However, the logic for detecting valid `In` expression/filter on record-key has bugs. It tries to prune files assuming that `In` expression/filter can reference only record-key column even when the `In` query is based on other columns.
For example, a query of the foem `select * from trips_table where driver in ('abc', 'xyz')` has the potential to return wrong results if the record-key for this table also has values 'abc' or 'xyz' for some rows of the table.
Attachments
Issue Links
- is depended upon by
-
HUDI-7007 Integrate functional index using bloom filter on reader side
- Reopened
- links to