Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Resolved
-
1.29.0
Description
Summary: Some engines(Hive,Spark,BigQuery,Oracle,Snowflake) support APPROX_COUNT_DISTINCT function, while others do not. So we can use the parameter SqlDialect#supportsApproxCountDistinct to control whether to use APPROX_COUNT_DISTINCT(It is the same as APPROX_DISTINCT for Presto).
Problem: Before fix for all SqlDialects
SELECT APPROX_COUNT_DISTINCT(product_id) FROM foodmart.product
will be
SELECT COUNT(DISTINCT product_id) FROM foodmart.product
This can cause many tasks to run too slowly.
Attachments
Issue Links
- links to