Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-3033

Add min/max values for the non-primary key columns in the metadata of rowsets/datablocks

Agile BoardAttach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      It's possible to add min/max values for the non-primary key columns in the metadata of diskrowset/datablock, and then we can skip decoding/evaluating the unnecessary diskrowset/datablock while scanning. Just like the "compute stats" feature on impala, and the only difference is that kudu supports updates. So, the min/max values should be invalid if the columns that have deltas while scanning.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            helifu LiFu He

            Dates

              Created:
              Updated:

              Slack

                Issue deployment