Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
Directory-based partitioning is a feature of Arrow, but could we support filename-based partitioning?
e.g. I have a series of CSV files here all called something like foo_month_year.csv and it'd be nice to be able to read them in and then the month/year bits of the filenames then appear as fields I can filter on etc.
Attachments
Issue Links
- is related to
-
ARROW-15260 [R] open_dataset - add file_name as column
- Resolved
-
ARROW-16080 [R][Documentation] Document filename-based partitioning and filename-as-variable functionality
- Open
-
ARROW-15281 [C++] Implement ability to retrieve fragment filename
- Resolved
- relates to
-
ARROW-16113 [Python] Partitioning.dictionaries in case of a subset of fields are dictionary encoded
- Resolved
-
ARROW-14176 [Python] Filename-based partitioning scheme
- Open
- links to