Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
None
Description
In the rust implementation of an Arrow RecordBatch writer to parquet (3.0.0-SNAPSHOT), the ArrowWriter::write call potentially allocates more memory than required.
For a RecordBatch with m rows and n columns, ArrowWriter::write allocates m*n definition levels, leading to m times the required memory usage.
Attachments
Issue Links
- links to