[FLINK-19898] [Kinesis][EFO] Ignore ReadTimeoutException from SubcribeToShard retry policy - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.12.0
Component/s: Connectors / Kinesis
Labels:
- pull-request-available

Description

Background

The Flink Kinesis EFO consumer has a SubscribeToShard retry policy which will terminate the job after a given number of subsequent attempt failures. In high backpressure scenarios the Netty HTTP Client throws a ReadTimeoutException when the consumer takes longer than 30s to process a batch. If this happens (by default) 10 times in a row, the job will terminate. There is no need to terminate in this condition, and the restart results in the job falling further behind.

Scope

Exclude the ReadTimeoutException from the SubscribeToShard retry policy, such that that connector will gracefully reconnect once the consumer has processed the queued records.

Attachments

Issue Links

links to

GitHub Pull Request #13886

Activity

People

Assignee:: Danny Cranmer

Reporter:: Danny Cranmer

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 30/Oct/20 10:21

Updated:: 04/Nov/20 10:42

Resolved:: 04/Nov/20 10:42