Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
I would like to enhance partition filters in methods such as:
pyarrow.parquet.ParquetDataset(path, filters)
I am proposing the below enhancements:
- for operator "in", "not in", the value should be any typing.Iteratable (also a container). But currently only set is supported while other iteratable, such as list, tuple cannot function correctly. I would like to change it to accept any iteratable.
- Enhance the documents about the partition filters.
I see there is a new version implemented with
_ParquetDatasetV2 which already accepts an iterable. So the documentation update is fine for the new version as well.
Attachments
Issue Links
- links to