Details
-
Improvement
-
Status: Patch Available
-
Critical
-
Resolution: Unresolved
-
None
-
None
Description
do a global search of these APIs
- org.apache.hudi.common.engine.HoodieEngineContext#flatMap
- org.apache.hudi.common.engine.HoodieEngineContext#map
and similar ones take in parallelism.
A lot of occurrences are using number of items as parallelism, which affect performance. Parallelism should be based on num cores available in the cluster and set by user via parallelism configs.
Attachments
Issue Links
- is related to
-
HUDI-712 Improve exporter performance and memory usage
- Closed
- links to