Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
Dremio uses 64K batch sizes. We could probably get away with even larger batch sizes (e.g. 256K or 1M) and allow memory-constrained users to elect a smaller batch size.
See example of some performance issues related to this in ARROW-9924
Attachments
Issue Links
- is fixed by
-
ARROW-9924 [Python] Performance regression reading individual Parquet files using Dataset interface
- Resolved
- relates to
-
ARROW-9924 [Python] Performance regression reading individual Parquet files using Dataset interface
- Resolved