Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
None
-
None
Description
There currently following issues w/ the current HoodieSparkRecord implementation:
- It rewrites records using `rewriteRecord` and `rewriteRecordWithNewSchema` which do Schema traversals for every record. Instead we should do schema traversal only once and produce a transformer that will directly create new record from the old one.
- Records are currently copied for every Executor even for Simple one which actually is not buffering any records and therefore doesn't require records to be copied.
Attachments
Issue Links
- is duplicated by
-
HUDI-5641 Streamline Advanced Schema Evolution flow
-
- Closed
-
- links to