Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.6.0
Description
Use try to match the behavior for single distinct aggregation with Spark 1.5, but that's not scalable, we should be robust by default, have a flag to address performance regression for low cardinality aggregation.
Attachments
Issue Links
- is duplicated by
-
SPARK-6006 Optimize count distinct in case of high cardinality columns
- Closed
- links to