Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-1639 Improve predicate pushdown
  3. KUDU-1291

Efficiently support predicates on non-prefix key components

    XMLWordPrintableJSON

Details

    Description

      In a lot of workloads, users have a compound primary key where the first component (or few components) is low cardinality. For example, a time series workload may have (year, month, day, entity_id, timestamp) as a primary key. A metrics or log storage workload might have (hostname, timestamp).

      It's common to want to do cross-user or cross-date analytics like 'WHERE timestamp BETWEEN <a> and <b>' without specifying any predicate for the first column(s) of the PK. Currently, we do not execute this efficiently, but rather scan the whole table evaluating the predicate.

      Attachments

        Activity

          People

            Unassigned Unassigned
            tlipcon Todd Lipcon
            Votes:
            0 Vote for this issue
            Watchers:
            11 Start watching this issue

            Dates

              Created:
              Updated: