Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-15040

[R] Enable write_csv_arrow to take Dataset or arrow_dplyr_query as input

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 8.0.0
    • R

    Description

      Currently, this code fails:

      dataset <- open_dataset("some/folder/with/parquet/files")
      write_csv_arrow(dataset, sink = "dataset.csv")
      

      with this error message:

      Error: x must be an object of class 'data.frame', 'RecordBatch', or 'Table', not 'FileSystemDataset'.
      

      In ARROW-14741, support was added for reading from a RecordBatchReader, so we should be able to now extend write_csv_arrow() to allow this behaviour.

       

      Note: We would need to make sure whatever write_csv(record_batch_reader) function can take a filesystem= argument

      Attachments

        Issue Links

          Activity

            People

              thisisnic Nicola Crane
              thisisnic Nicola Crane
              Votes:
              1 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 5h
                  5h