[ARROW-15040] [R] Enable write_csv_arrow to take Dataset or arrow_dplyr_query as input - ASF JIRA

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 8.0.0
Component/s: R
Labels:
- pull-request-available

External issue URL:
https://github.com/apache/arrow/issues/30557

Description

Currently, this code fails:

dataset <- open_dataset("some/folder/with/parquet/files")
write_csv_arrow(dataset, sink = "dataset.csv")

with this error message:

Error: x must be an object of class 'data.frame', 'RecordBatch', or 'Table', not 'FileSystemDataset'.

In ~~ARROW-14741~~, support was added for reading from a RecordBatchReader, so we should be able to now extend write_csv_arrow() to allow this behaviour.

Note: We would need to make sure whatever write_csv(record_batch_reader) function can take a filesystem= argument

Attachments

Issue Links

depends upon

ARROW-14741 [C++] Allow CSV Writer to take a RecordBatchReader as input

Resolved

is blocked by

ARROW-15128 [C++] segfault when writing CSV from RecordBatchReader

Closed

is duplicated by

ARROW-15104 write_parquet() / write_csv_arrow() cannot stream a dataset object back to S3

Closed

relates to

ARROW-15271 [R] Refactor do_exec_plan to return a RecordBatchReader

Resolved

links to

GitHub Pull Request #11971

Activity

Ascending order - Click to sort in descending order

Nicola Crane added a comment - 01/Mar/22 18:08

Issue resolved by pull request 11971
https://github.com/apache/arrow/pull/11971

Nicola Crane added a comment - 01/Mar/22 18:08 Issue resolved by pull request 11971 https://github.com/apache/arrow/pull/11971

Rok Mihevc added a comment - 11/Jan/23 08:44

This issue has been migrated to issue #30557 on GitHub. Please see the migration documentation for further details.

Rok Mihevc added a comment - 11/Jan/23 08:44 This issue has been migrated to issue #30557 on GitHub. Please see the migration documentation for further details.

People

Assignee:: Nicola Crane

Reporter:: Nicola Crane

Votes:: 1 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 09/Dec/21 08:10

Updated:: 11/Jan/23 08:44

Resolved:: 01/Mar/22 18:08

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

Apache Arrow