Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-13737

[C++] Support scalar columns in hash aggregations (was: hash_sum on scalar column segfaults)

    XMLWordPrintableJSON

Details

    Description

      Reproducer in R. Works fine without grouping, but segfaults if you group:

      tab <- Table$create(a=1:4, b=6:9)
      
      tab %>% 
        mutate(counter = 1) %>% 
        summarize(count = sum(counter))
      
      Table
      1 rows x 1 columns
      $count <double>
      
      tab %>% 
        mutate(counter = 1) %>% 
        group_by(a) %>% 
        summarize(count = sum(counter))
      
       *** caught segfault ***
      address 0x0, cause 'invalid permissions'
      
      Traceback:
       1: .Call(`_arrow_ExecPlan_run`, plan, final_node)
       2: ExecPlan_run(self, node)
       3: plan$Run(final_node)
       4: do_exec_plan(.data, group_vars = dplyr::group_vars(.data))
       5: do_arrow_summarize(.data, ...)
      ...
      

      Attachments

        Issue Links

          Activity

            People

              lidavidm David Li
              npr Neal Richardson
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 2h 20m
                  2h 20m