Details
-
New Feature
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
None
Description
ARROW-13620 added a binding for n_distinct() but it only works for grouped aggregation, not whole-table aggregation.
This works:
Table$create(starwars) %>% group_by(homeworld) %>% summarise(n_distinct(species)) %>% collect()
but this errors:
Table$create(starwars) %>% summarise(n_distinct(species)) %>% collect() #> Error: Key error: No function registered with name: count_distinct
Once we have a non-hash count_distinct aggregate kernel in the C++ library (ARROW-14035) we should bind the options for it in the R package and add a test.
Attachments
Issue Links
- depends upon
-
ARROW-14035 [C++][Compute] Implement non-hash count_distinct aggregate kernel
- Resolved
- relates to
-
ARROW-13620 [R] Binding for n_distinct()
- Resolved