Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
None
Description
Currently reading parquet files generated by Hadoop (EMR) from S3 fails with "ValueError: Found files in an intermediate directory" because of the _$folder$ empty files.
The fix should be easy, just an extra condition in ParquetManifest._should_silently_exclude.
Attachments
Issue Links
- links to