Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-14321

[R] segfault converting dictionary ChunkedArray with 0 chunks

    XMLWordPrintableJSON

Details

    Description

      It appears to happen when one of the filter parts has no matching rows:

      library(arrow)
      library(dplyr)
      first_date <- lubridate::ymd_hms("2015-04-29 03:12:39")
      
      df1 <- tibble::tibble(
        int = 1:10,
        dbl = as.numeric(1:10),
        lgl = rep(c(TRUE, FALSE, NA, TRUE, FALSE), 2),
        chr = letters[1:10],
        fct = factor(LETTERS[1:10]),
        ts = first_date + lubridate::days(1:10)
      )
      
      ds <- InMemoryDataset$create(df1)
      
      # works
      ds %>% 
        filter(int < 8) %>%
        arrange(dbl) %>%
        collect()
      
      # segfaults
      ds %>% 
        filter(int < 8, int > 55) %>%
        arrange(dbl) %>%
        collect()
      
       segfaults
      ds %>% 
        filter(int < 0) %>%
        arrange(dbl) %>%
        collect()
      

      Attachments

        Issue Links

          Activity

            People

              westonpace Weston Pace
              jonkeane Jonathan Keane
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 40m
                  40m