Details
-
Improvement
-
Status: Open
-
P3
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Parquet API allows block size support, which can improve IO performance when working with Parquet files. Currently, the ParquetIO does not support it at all so it looks like a room for improvement for this IO.
Good intro into this topic: https://www.dremio.com/tuning-parquet/
Attachments
Issue Links
- is duplicated by
-
BEAM-10842 Record level progress tracking for BlockTracker
- Resolved
- relates to
-
BEAM-214 Create Parquet IO
- Resolved