Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-8748

[R] Add bindings to ConcatenateTables

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • R
    • None

    Description

      First at all, many thanks for your hard work! I was quite exited, when you guys implemented some basic function of the the dplyr package. Is there a why to combine tow or more arrow tables into one by rows or columns? At the moment my workaround looks like this:

      dplyr::bind_rows(
         "a" = arrow.table.1 %>% dplyr::collect(),
         "b" = arrow.table.2 %>% dplyr::collect(),
         "c" = arrow.table.3 %>% dplyr::collect(),
         "d" = arrow.table.4 %>% dplyr::collect(),
         .id = "ID"
       ) %>% 
       arrow::write_ipc_stream(sink = "file_name_combined_tables.arrow")
      

      But this is actually not really a meaningful measure because of putting the data back as dataframes/tibbles into the r environment, which might lead to an exhaust of RAM space. Perhaps you might have a better workaround on hand. It would be great if you guys could implement the bind_rows and bind_cols methods provided by dplyr.

      dplyr::bind_rows(
         "a" = arrow.table.1,
         "b" = arrow.table.2,
         "c" = arrow.table.3,
         "d" = arrow.table.4, 
         .id = "ID"
      ) %>% 
       arrow::write_ipc_stream(sink = "file_name_combined_tables.arrow")

       

       

      Attachments

        Activity

          People

            Unassigned Unassigned
            domiden Dominic Dennenmoser
            Votes:
            2 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 20m
                20m