Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-5796

Filter pruning for multi rowgroup parquet file

    XMLWordPrintableJSON

Details

    Description

      Today, filter pruning use the file name as the partitioning key. This means you can remove a partition only if the whole file is for the same partition. With parquet, you can prune the filter if the rowgroup make a partition of your dataset as the unit of work if the rowgroup not the file.

      Attachments

        Issue Links

          Activity

            People

              jimbert Jean-Blas IMBERT
              dprofeta Damien Profeta
              Vlad Rozov Vlad Rozov
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: