Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-5796

Filter pruning for multi rowgroup parquet file

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      Today, filter pruning use the file name as the partitioning key. This means you can remove a partition only if the whole file is for the same partition. With parquet, you can prune the filter if the rowgroup make a partition of your dataset as the unit of work if the rowgroup not the file.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            jimbert Jean-Blas IMBERT
            dprofeta Damien Profeta
            Vlad Rozov Vlad Rozov
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment