Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-12087 Performance improvements on Iceberg table queries
  3. IMPALA-12089

Be able to skip pushing down a subset of the predicates

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Done
    • None
    • None
    • Frontend
    • ghx-label-3

    Description

      https://issues.apache.org/jira/browse/IMPALA-11701 introduced logic to skip pushing down predicates to Impala scanners if they are already applied by Iceberg and won't filter any further rows. This is an "all or nothing" approach where we either skip pushing down all the predicates or we push down all of them.

      As a more sophisticated approach we should be able to push down a subset of the predicates to Impala Scan nodes. For this we should be able to map Iceberg predicates (returned from residual()) to Impala predicates. This might not be that trivial as Iceberg sometimes doesn't return the exact same predicates as it received through planFiles(). E.g. the object ID might be different making the mapping more difficult.

      Attachments

        Issue Links

          Activity

            People

              prozsa Peter Rozsa
              gaborkaszab Gabor Kaszab
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: