Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-2328

Parquet scan should use min/max statistics to skip blocks based on predicate

    Details

      Description

      Parquet stores min/max stats which can be used to skip reading blocks if they don't qualify a certain predicate

      The query below ends up scanning all rows, which is not needed.

      select count(*) from tpch_parquet.lineitem where l_orderkey = -1;
      

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                lv Lars Volker
                Reporter:
                mmokhtar Mostafa Mokhtar
              • Votes:
                3 Vote for this issue
                Watchers:
                14 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: