Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-10709

Min/max filters should be enabled for joins on sorted columns in Parquet tables

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Test
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • Impala 4.1.0
    • None
    • None
    • ghx-label-7

    Description

      Currently, the min/max filter feature is turned off by default (MINMAX_FILTER_THRESHOLD=0).

      When joining into sorted columns in a fact Parquet table created by Imoala, the feature can be turned on by default. This is because Impala sorts the data in sort by columns in each data file during population. A min/max filter can be used to easily reject pages not overlapping with the search region specified in the filter.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            sql_forever Qifan Chen
            sql_forever Qifan Chen
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment