Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-9854

[R] Support reading/writing data to/from S3

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.0.0
    • R

    Description

      Current S3 support is limited to (1) being able to instantiate an S3FileSystem object, primarily from a URI, and (2) ability to open_dataset from an S3 URI. Before widely declaring that we support S3 in R, we should be able to:

      • download dataset (i.e. copy files/directory recursively)
      • read_parquet/feather/etc. from S3 (use FileSystem->OpenInputFile(path))
      • write_$FORMAT via FileSystem->OpenOutputStream(path)
      • write_dataset
      • for linux, an argument to install_arrow to help, assuming you've installed aws-sdk-cpp already (turn on ARROW_S3, AWSSDK_SOURCE=SYSTEM)
      • testing with minio on CI
      • set up a real test bucket and user for e2e testing
      • update docs and vignettes

      Attachments

        Issue Links

          Activity

            People

              npr Neal Richardson
              npr Neal Richardson
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 50m
                  50m