Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Right now count distinct on Druid is translated as a 'cardinality' aggregator which uses hyperloglog and return approximate results. See cardinality aggregator here - http://druid.io/docs/latest/querying/aggregations.html for details.
case COUNT: if (aggCall.isDistinct()) { return new JsonCardinalityAggregation("cardinality", name, list); } return new JsonAggregation("count", name, only);
The current recommended way in druid to get exact counts is to do a nested groupby query.
Attachments
Issue Links
- is related to
-
CALCITE-1588 Add SQL syntax to allow approximate LIMIT and distinct-COUNT
- Open
-
CALCITE-1587 Druid adapter: topN returns approximate results
- Closed
- relates to
-
CALCITE-1787 thetaSketch Support for Druid Adapter
- Closed