Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
parquet/column_writer.cc WriteArrow implementations at certain points checks null counts/required data and passes through the null bitmap for encoding. This only works for nested data types if the if the null slot on a parent implies a null slot on the leaf. This relationship is not required by the specifications.
Most paths for creating arrays follow this pattern so it would be esoteric to hit this bug, but we should still fix it.
All branches that rely on reading nullness should generate a new null bitmap based on definition levels if the column is nested, and decisions should be based off of that.
Attachments
Issue Links
- links to