Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
Parquet writers like delta-rs rely on the Parquet metadata to write file-level statistics for file pruning purposes.
We currently do not expose these stats, requiring the writer to read the file that has just been written, to get the stats. This is more problematic for in-memory sinks, as there is currently no way of getting the metadata from the sink before it's persisted.
Explore if we can expose these stats to the writer, to make the above easier.
Attachments
Issue Links
- links to