[CALCITE-732] Implement multiple distinct-COUNT using GROUPING SETS - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.4.0-incubating
Component/s: None
Labels:
None

Description

Currently if a query has COUNT(DISTINCT x) and COUNT(DISTINCT y) we compute the distinct counts separately and combine them using a join. The join isn't too expensive (because usually the GROUP BY has only a few keys) but we make multiple scans over the base table.

I think we could translate multiple distinct-counts into a GROUPING SETS query (i.e. an Aggregate with more than one element in the groupSets field). If the underlying engine can evaluate that efficiently, then we have saved ourselves a join and several scans.

Attachments

Issue Links

relates to

CALCITE-6332 Optimization CoreRules.AGGREGATE_EXPAND_DISTINCT_AGGREGATES_TO_JOIN produces incorrect results for aggregates with groupSets

Closed

Activity

People

Assignee:: Julian Hyde

Reporter:: Julian Hyde

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 15/May/15 20:13

Updated:: 17/Mar/24 00:53

Resolved:: 01/Jun/15 18:32