Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.0.0
Description
Currently format specific scan options are embedded as members of the corresponding subclass of FileFormat. Extracting these to an options struct would provide better separation of concerns; currently the only way to scan a parquet formatted dataset with different options is to reconstruct it in a differently optioned format from its component files.
CsvFileFormat could retain ParseOptions as a member, since (for example) tab-separated vs comma-separated values can justifiably be considered different formats.
Attachments
Issue Links
- is related to
-
ARROW-11974 [GLib] Add CsvFragmentScanOption support
- Open
-
ARROW-12059 [R] Accept format-specific scan options in collect()
- Open
-
ARROW-11972 [C++][Dataset] Extract IpcFragmentScanOptions, ParquetFragmentScanOptions
- Resolved
- relates to
-
ARROW-8631 [C++][Dataset] Add ConvertOptions and ReadOptions to CsvFileFormat
- Resolved
- links to