Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
None
-
None
Description
Currently, invoking into UDF w/in Hudi's Bulk Insert causes 20% perf-gap as compared against raw Parquet Bulk Insert into a non-partitioned table.
Attachments
Issue Links
- fixes
-
HUDI-4374 Support BULK_INSERT row-writing on streaming Dataset/DataFrame
- Closed
- relates to
-
HUDI-4365 Bulk Insert not URL encoding Partition Path properly
- Closed
-
HUDI-3995 Bulk insert row writer perf improvements
- Closed
-
HUDI-4036 Investigate whether meta fields could be omitted completely
- Open
- links to