Details
-
New Feature
-
Status: Closed
-
Major
-
Resolution: Invalid
-
None
-
None
-
None
Description
Parquet ArrayReader currently only supports reading an entire file from start to finish and does not allow selectively reading a subset of row groups. This prevents us from parallelizing work across threads when processing a single parquet file.
Attachments
Issue Links
- blocks
-
ARROW-10995 [Rust] [DataFusion] Improve parallelism when reading Parquet files
- Resolved
- is depended upon by
-
ARROW-11056 [Rust] [DataFusion] Allow ParquetExec to parallelize work based on row groups
- Closed