[FLINK-34567] flink task manager error occur, msg: Encountered error while consuming partitions - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 1.16.2
Fix Version/s: None
Component/s: None
Labels:
- flink

Language:
- java

Description

I deploy flink cluster (version: 1.16.2) and it run normally about 2 months, but recently i meet a problem. I see some sub tasks back pressure is high and the flink job is totally blocked(in pic1.jpg), these sub tasks are all in one task manager. so i stop the abnormal task manager and deploy flink job again, the problem is solved. I find some error log in the abnormal task manager:

2024-03-03 15:57:25,088 ERROR org.apache.flink.runtime.io.network.netty.PartitionRequestQueue [] - Encountered error while consuming partitions
org.apache.flink.shaded.netty4.io.netty.channel.unix.Errors$NativeIoException: readAddress(..) failed: Connection timed out

I check the abnormal task manager deployed machine. cpu, memory, network is as normal as other task manager deployed machine, so it doesn't look like a hardware problem.

What does it mean?

What should i do to solve this problem completely?

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

pic1.jpg
04/Mar/24 03:38
202 kB
yamanda

Activity

People

Assignee:: Unassigned

Reporter:: yamanda

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 04/Mar/24 03:49

Updated:: 04/Mar/24 06:20

Time Tracking

Estimated:

96h

Remaining:

96h

Logged:

Not Specified