Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-7569

Fix wrong result while using RLI for pruning files

    XMLWordPrintableJSON

Details

    Description

      Data skipping (pruning files) for RLI is supported only when the query predicate has `EqualTo` or `In` expressions/filters on the record-key column. However, the logic for detecting valid `In` expression/filter on record-key has bugs. It tries to prune files assuming that `In` expression/filter can reference only record-key column even when the `In` query is based on other columns.

       

      For example, a query of the foem `select * from trips_table where driver in ('abc', 'xyz')` has the potential to return wrong results if the record-key for this table also has values 'abc' or 'xyz' for some rows of the table.

      Attachments

        Issue Links

          Activity

            People

              vinay.bhat Vinaykumar Bhat
              vinay.bhat Vinaykumar Bhat
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: