Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
[looking into using the datasets machinery in the current python parquet code]
In the current python API, we expose several options that influence reading the parquet file (eg read_dictionary to indicate to read certain BYTE_ARRAY columns directly into a dictionary type, or memory_map, buffer_size).
Those could be added to ParquetFileFormat.
Attachments
Issue Links
- links to