Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
Problem:
At LinkedIn, we noticed several standalone users complained about lag/downtime during rolling deployments/upgrades.
Description:
During rolling upgrades, the current debounce timer gets extended every time when there is a quorum change notification. As a result, processors that were upgraded earlier in the deployment window remain unavailable waiting for work assignment. In some scenarios, this cause processors to be unavailable for 20 minutes or so depending on the size of the quorum and the debounce time configuration.
Impact:
Partitions that were stopped for initial processors as part of upgrade remain unassigned for the entire deployment window which can result in processing lag.
Attachments
Issue Links
- links to