Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-10921

`TypeError: 'coroutine' object is not iterable` when reading parquet partitions via s3fs >= 0.5 with pyarrow

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 3.0.0
    • Python
    • None

    Description

      Trying to read partitioned parquet files using updated s3fs >= 0.5 (using via }}{{dask), and got this error:

       

      410 files = set()
       411 
      --> 412 for key in list(self.fs._ls(path, refresh=refresh)):
       413 path = key['Key']
       414 if key['StorageClass'] == 'DIRECTORY':
      
      TypeError: 'coroutine' object is not iterable
      
      

      coming from https://github.com/apache/arrow/blob/9baa123ea38ee9cc1d3a90cfc9347239cd28064c/python/pyarrow/filesystem.py#L415 

       

      Seems related to switching s3fs to asyncio in https://github.com/dask/s3fs/pull/336.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              inecas Ivan Necas
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: