[KAFKA-1461] Replica fetcher thread does not implement any back-off behavior - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 0.8.1.1
Fix Version/s: 0.9.0.0
Component/s: replication
Labels:
- newbie++

Description

The current replica fetcher thread will retry in a tight loop if any error occurs during the fetch call. For example, we've seen cases where the fetch continuously throws a connection refused exception leading to several replica fetcher threads that spin in a pretty tight loop.

To a much lesser degree this is also an issue in the consumer fetcher thread, although the fact that erroring partitions are removed so a leader can be re-discovered helps some.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

KAFKA-1461_2015-04-07_08:41:18.patch
07/Apr/15 15:41
20 kB
Harsha
KAFKA-1461_2015-04-03_20:48:34.patch
04/Apr/15 03:48
19 kB
Harsha
KAFKA-1461_2015-03-27_17:02:32.patch
28/Mar/15 00:02
17 kB
Harsha
KAFKA-1461_2015-03-27_16:56:45.patch
27/Mar/15 23:56
15 kB
Harsha
KAFKA-1461_2015-03-27_15:31:11.patch
27/Mar/15 22:31
15 kB
Harsha
KAFKA-1461_2015-03-17_16:03:33.patch
17/Mar/15 23:03
12 kB
Harsha
KAFKA-1461_2015-03-12_13:54:51.patch
12/Mar/15 20:54
13 kB
Harsha
KAFKA-1461_2015-03-11_18:17:51.patch
12/Mar/15 01:18
12 kB
Harsha
KAFKA-1461_2015-03-11_10:41:26.patch
11/Mar/15 17:41
18 kB
Harsha
KAFKA-1461.patch
11/Mar/15 05:33
12 kB
Harsha
KAFKA-1461.patch
24/Feb/15 18:02
19 kB
Harsha

Issue Links

is duplicated by

KAFKA-1629 Replica fetcher thread need to back off upon getting errors on partitions

Resolved

relates to

KAFKA-2150 FetcherThread backoff need to grab lock before wait on condition.

Resolved

Activity

People

Assignee:: Harsha

Reporter:: Sam Meder

Votes:: 2 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 19/May/14 18:47

Updated:: 13/Nov/15 15:33

Resolved:: 07/Apr/15 22:27