Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
None
-
None
Description
Metadata Table v2 performs an inline compaction once a deltacommit has been written.
Timeline:
(on dataset) t1.commit.requested
(on dataset) t1.commit.inflight
---- all parquet writes complete here, WriteStatus generated---
(on metadata table) t1.deltacommit.requested
(on metadata table) t1.deltacommit.inflight
(on metadata table) t1.deltacommit
---- deltcommit completed ----
(on metadata table) t1-001.compaction.requested
(on metadata table) t1-001.compaction.inflight
(on metadata table) t1-001.commit
If the t1.commit fails on the dataset then metadata table has already included information from the t1.commit in its base files which will be returned to readers. The metadata table reader logic only checks for deltacommits against completed instants on the dataset timeline and assumes a base file is always SANE.
Attachments
Issue Links
- is duplicated by
-
HUDI-2869 Metadata bootstrapping should ignore data files from partial/inflight commits
- Closed
- links to