Details
-
Improvement
-
Status: Resolved
-
Blocker
-
Resolution: Resolved
-
1.13.0
-
None
Description
In 1.12, we introduced sort-merge blocking shuffle to Flink and in 1.13, the feature was optimized which improves the usability (fix direct memory OOM issue) and performance (introduce IO scheduling and broadcast optimization).
The sort-merge blocking shuffle can be tested following the bellow process:
- Write a simple batch job using either sql/table or DataStream API; (Word count should be enough)
- Enable sort-merge blocking shuffle by setting taskmanager.network.sort-shuffle.min-parallelism to 1 in the Flink configuration file;
- Submit and run the batch job with different parallelism and data volume;
- Tune the relevant config options (taskmanager.network.blocking-shuffle.compression.enabled, taskmanager.network.sort-shuffle.min-buffers, taskmanager.memory.framework.off-heap.batch-shuffle.size) and see the influence.
Attachments
Issue Links
- is related to
-
FLINK-21967 Add documentation on the operation of blocking result partition
- Closed
-
FLINK-19614 Further optimization of sort-merge based blocking shuffle
- Closed
1.
|
Document sort-merge blocking shuffle | Closed | Unassigned | |
2.
|
Improve log messages of sort-merge blocking shuffle | Closed | Yingjie Cao |