Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-15104

write_parquet() / write_csv_arrow() cannot stream a dataset object back to S3

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Duplicate
    • 6.0.1
    • None
    • R
    • None

    Description

      It seems we cannot send a dataset (table) object opened with `open_dataset()` into `write_parquet()` or `write_csv_arrow()`.  I realize that the former is a 'multi-file' representation and the latter two are not, but as both represent a single table (single schema) it seems like this should still work.  (It also looks like write_csv_arrow() gives us more control over formatting details than write_dataset() )

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              cboettig Carl Boettiger
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: