Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
As a follow up to PARQUET-1698 and ARROW-7995, we should use the I/O coalescing facility (where available and enabled), in combination with a column filter hint, to compute and prefetch the exact byte ranges we will be reading (using the metadata). This should further improve performance on remote object stores like Amazon S3.
Attachments
Issue Links
- Dependent
-
PARQUET-1698 [C++] Add reader option to pre-buffer entire serialized row group into memory
- Resolved
- is duplicated by
-
PARQUET-1698 [C++] Add reader option to pre-buffer entire serialized row group into memory
- Resolved
- is related to
-
ARROW-11601 [C++][Dataset] Expose pre-buffering in ParquetFileFormatReaderOptions
- Resolved
- relates to
-
ARROW-7995 [C++] IO: coalescing and caching read ranges
- Resolved
- links to