Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-1213

[Python] Enable s3fs to be used with ParquetDataset and reader/writer functions

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 0.6.0
    • Python

    Description

      Pyarrow dataset function can't read from s3 using s3fs as the filesystem. Is there a way we can add the support for read from s3 based on partitioned files ?

      I am trying to address the problem mentioned in the stackoverflow link :
      https://stackoverflow.com/questions/45082832/how-to-read-partitioned-parquet-files-from-s3-using-pyarrow-in-python

      Attachments

        Activity

          People

            wesm Wes McKinney
            yackoa Yacko
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 20m
                20m