[KAFKA-1355] Reduce/optimize update metadata requests sent during leader election - ASF JIRA

Details

Type: Bug
Status: Closed
Priority: Critical
Resolution: Fixed
Affects Version/s: 0.8.1
Fix Version/s: 0.8.1.1
Component/s: None
Labels:
None

Description

This is part of the investigation into slow shutdowns in 0.8.1. While
logging contributes to bulk of the regression, this one also adds
quite a bit of overhead:

In addLeaderAndIsrRequest (called for every partition that is led by the
broker being shut down) we also add an UpdateMetadataRequest - each call to
addUpdateMetadataRequests does two traversals over all (global)
partitions. I think it should be straightforward to optimize this a bit.

Marking as critical, since it is not as big an overhead as the logging.

Attachments

KAFKA-1355_2014-04-04_13:48:34.patch
04/Apr/14 20:48
16 kB
Joel Jacob Koshy
KAFKA-1355_2014-04-04_13:51:22.patch
04/Apr/14 20:51
16 kB
Joel Jacob Koshy
KAFKA-1355_2014-04-17_14:48:57.patch
17/Apr/14 21:49
16 kB
Joel Jacob Koshy
KAFKA-1355.patch
10/Apr/14 21:02
16 kB
Joel Jacob Koshy

Issue Links

Add Link

is depended upon by

KAFKA-1380 0.8.1.1 release candidate

Closed

Delete this link

Activity

Ascending order - Click to sort in descending order

Joel Jacob Koshy added a comment - 01/Apr/14 01:45

BTW, here are the configs and steps I used for this, ~~KAFKA-1342~~ and ~~KAFKA-1350~~:

Four brokers, 100 topics, eight partitions each.

Log4j and server configs by broker:
https://gist.github.com/anonymous/9906088
https://gist.github.com/anonymous/9906092
https://gist.github.com/anonymous/9906096
https://gist.github.com/anonymous/9906102
https://gist.github.com/anonymous/9906144
https://gist.github.com/anonymous/9906148
https://gist.github.com/anonymous/9906153
https://gist.github.com/anonymous/9906157

Producer performance: https://gist.github.com/anonymous/9906163

(At the end, just grep in the controller's request log and extract local time)
grep ControlledShutdownRequest logs/kafka-request*

Joel Jacob Koshy added a comment - 01/Apr/14 01:45 BTW, here are the configs and steps I used for this, KAFKA-1342 and KAFKA-1350 : Four brokers, 100 topics, eight partitions each. Log4j and server configs by broker: https://gist.github.com/anonymous/9906088 https://gist.github.com/anonymous/9906092 https://gist.github.com/anonymous/9906096 https://gist.github.com/anonymous/9906102 https://gist.github.com/anonymous/9906144 https://gist.github.com/anonymous/9906148 https://gist.github.com/anonymous/9906153 https://gist.github.com/anonymous/9906157 Producer performance: https://gist.github.com/anonymous/9906163 (At the end, just grep in the controller's request log and extract local time) grep ControlledShutdownRequest logs/kafka-request*

Joel Jacob Koshy added a comment - 04/Apr/14 18:23

https://reviews.apache.org/r/20038

Joel Jacob Koshy added a comment - 04/Apr/14 18:23 https://reviews.apache.org/r/20038

Joel Jacob Koshy added a comment - 04/Apr/14 20:48

Updated reviewboard https://reviews.apache.org/r/20038/
against branch origin/trunk

Joel Jacob Koshy added a comment - 04/Apr/14 20:48 Updated reviewboard https://reviews.apache.org/r/20038/ against branch origin/trunk

Joel Jacob Koshy added a comment - 04/Apr/14 20:51

Updated reviewboard https://reviews.apache.org/r/20038/
against branch origin/trunk

Joel Jacob Koshy added a comment - 04/Apr/14 20:51 Updated reviewboard https://reviews.apache.org/r/20038/ against branch origin/trunk

Joel Jacob Koshy added a comment - 09/Apr/14 18:01

Committed to trunk (including the comment fix in Jun's follow-up review).

Need patch for 0.8.1

Joel Jacob Koshy added a comment - 09/Apr/14 18:01 Committed to trunk (including the comment fix in Jun's follow-up review). Need patch for 0.8.1

Joel Jacob Koshy added a comment - 10/Apr/14 21:02

Created reviewboard https://reviews.apache.org/r/20232/
against branch origin/0.8.1

Joel Jacob Koshy added a comment - 10/Apr/14 21:02 Created reviewboard https://reviews.apache.org/r/20232/ against branch origin/0.8.1

Neha Narkhede added a comment - 12/Apr/14 20:10

Joel Jacob Koshy Should we also check in this patch to 0.8.1. I'm not sure if we waiting on something?

Neha Narkhede added a comment - 12/Apr/14 20:10 Joel Jacob Koshy Should we also check in this patch to 0.8.1. I'm not sure if we waiting on something?

Joel Jacob Koshy added a comment - 12/Apr/14 22:18

Tim's patch in 1363 conflicts with this. So we can get 1363 in first, and I will rebase this one.

Joel Jacob Koshy added a comment - 12/Apr/14 22:18 Tim's patch in 1363 conflicts with this. So we can get 1363 in first, and I will rebase this one.

Neha Narkhede added a comment - 12/Apr/14 23:46

~~KAFKA-1363~~ is in 0.8.1 as well as trunk now.

Neha Narkhede added a comment - 12/Apr/14 23:46 KAFKA-1363 is in 0.8.1 as well as trunk now.

Joel Jacob Koshy added a comment - 14/Apr/14 17:47

Sorry - I meant ~~KAFKA-1356~~, not 1363 - will check that in first after reviewing and rebase this.

Joel Jacob Koshy added a comment - 14/Apr/14 17:47 Sorry - I meant KAFKA-1356 , not 1363 - will check that in first after reviewing and rebase this.

Joel Jacob Koshy added a comment - 14/Apr/14 17:48

That has been marked as closed, but https://reviews.apache.org/r/20252/ has not been checked into 0.8.1

Joel Jacob Koshy added a comment - 14/Apr/14 17:48 That has been marked as closed, but https://reviews.apache.org/r/20252/ has not been checked into 0.8.1

Joel Jacob Koshy added a comment - 17/Apr/14 21:49

Updated reviewboard https://reviews.apache.org/r/20232/
against branch origin/0.8.1

Joel Jacob Koshy added a comment - 17/Apr/14 21:49 Updated reviewboard https://reviews.apache.org/r/20232/ against branch origin/0.8.1

Joel Jacob Koshy added a comment - 18/Apr/14 21:04

Thanks for the review. Committed to 0.8.1 as well.

Joel Jacob Koshy added a comment - 18/Apr/14 21:04 Thanks for the review. Committed to 0.8.1 as well.

Comment

Viewable by All Users

Cancel

People

Assignee:: Unassigned

Reporter:: Joel Jacob Koshy

Votes:: 1 Vote for this issue

Watchers:: 4
Start watching this issue

Dates

Created:: 01/Apr/14 01:25

Updated:: 27/May/14 18:42

Resolved:: 18/Apr/14 21:04

Agile

View on Board

Slack

Issue deployment

Kafka

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Agile

Slack

Issue deployment