Details
-
Bug
-
Status: Resolved
-
Blocker
-
Resolution: Fixed
-
None
Description
If one puts a field that is one of the partitioning variables in group_by() and then summarises, we get a segfault:
library(arrow) library(dplyr) temp <- tempfile() write_dataset(mtcars, path = temp, partitioning = "cyl") ds <- open_dataset(temp) # this works just fine ds %>% group_by(gear) %>% summarise( sum(mpg) ) %>% collect() # however this segfaults (regardless of the aggregation, even simply n()) # *** caught segfault *** # address 0x0, cause 'memory not mapped' ds %>% group_by(cyl) %>% summarise( sum(mpg) ) %>% collect()
Attachments
Issue Links
- links to