Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-1639 Improve predicate pushdown
  3. KUDU-1291

Efficiently support predicates on non-prefix key components

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      In a lot of workloads, users have a compound primary key where the first component (or few components) is low cardinality. For example, a time series workload may have (year, month, day, entity_id, timestamp) as a primary key. A metrics or log storage workload might have (hostname, timestamp).

      It's common to want to do cross-user or cross-date analytics like 'WHERE timestamp BETWEEN <a> and <b>' without specifying any predicate for the first column(s) of the PK. Currently, we do not execute this efficiently, but rather scan the whole table evaluating the predicate.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            tlipcon Todd Lipcon

            Dates

              Created:
              Updated:

              Slack

                Issue deployment