Details
-
Sub-task
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.14.0
-
None
-
None
Description
given:
The job with 5 maps(with keyBy).
All channels are remote. Parallelism is 80
The first task produces only two keys - `indexOfThisSubtask` and `indexOfThisSubtask + 1`. So every subTask has a constant value of active channels(depends on hash rebalance)
Every record has an equal size and is processed for an equal time.
when:
The buffer debloat is enabled with the default configuration.
then:
The buffer size synchonizes on every subTask on the first map for some reason. It can have the strong synchronization as shown on the erraticBufferSize1 picture but usually synchronization is less explicit as on erraticBufferSize2.
Expected:
After the stabilization period the buffer size should be mostly constant with small fluctuation or the different tasks should be in antiphase to each other(when one subtask has small buffer size the another should have a big buffer size). for example the picture antiphaseBufferSize
Unfortunatelly, it is not reproduced every time which means that this problem can be connected to environment. But at least, it makes sense to try to understand why we have so strange load shape when only several input channels are active.
Attachments
Attachments
Issue Links
- Dependent
-
FLINK-23974 Decreased throughput with enabled buffer debloat
- Closed