Details
-
Bug
-
Status: Open
-
Critical
-
Resolution: Unresolved
-
2.1.1
-
None
-
None
-
None
Description
I carried out a test as mentioned in my SO question
Gist of the test:
A broker's INPUT access is blocked. So it is not able to receive any messages.
But still it can send heartbeats to ZK, so that a leader election will not happen.
So any message produced to the partition lead by this zombie broker is never produced leaving the system in an unrecoverable state.
Possible resolution:
There should be a 2 way communication such that, if a broker is not able to have any INPUT access, the ZK MUST know of it by sending some ping messages to the brokers.
If there is no response from the broker, elect a new one. Since, if the broker is not ping-able by ZK, that broker is as good as dead for its purpose.