Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
0.12.0
-
None
Description
Troubleshooting duplicates issue w/ Abhishek Modi from Notion, we've found that the min/max record key stats are being currently persisted incorrectly into Parquet metadata, leading to duplicate records being produced in their pipeline after initial bulk-insert.
Attachments
Issue Links
- is a parent of
-
HUDI-5051 Add a functional regression test for Bloom Index followed on w/ Upserts
- Closed
- links to