Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
Description
After upgrade the version to 0.14.0, the performance of the Spark job, which is written into a simple bucket index table, is regressing.
The reason is in the PR#4480, the refactor of bucket index introduce two unnecessary stages in tag for simple bucket index.
List<String> partitions = records.map(HoodieRecord::getPartitionPath).distinct().collectAsList();
Attachments
Attachments
Issue Links
- links to