Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-14036

[R] Binding for n_distinct() with no grouping

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • None
    • 6.0.0
    • R

    Description

      ARROW-13620 added a binding for n_distinct() but it only works for grouped aggregation, not whole-table aggregation. 

      This works:

      Table$create(starwars) %>%
        group_by(homeworld) %>%
        summarise(n_distinct(species)) %>%
        collect()

      but this errors:

      Table$create(starwars) %>%
        summarise(n_distinct(species)) %>%
        collect()
      
      #> Error: Key error: No function registered with name: count_distinct

      Once we have a non-hash count_distinct aggregate kernel in the C++ library (ARROW-14035) we should bind the options for it in the R package and add a test.

      Attachments

        Issue Links

          Activity

            People

              aucahuasi Percy Camilo Triveño Aucahuasi
              icook Ian Cook
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: