Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-3083

a soft failure in controller may leave a topic partition in an inconsistent state


    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s:
    • Fix Version/s: 1.1.0
    • Component/s: core
    • Labels:


      The following sequence can happen.

      1. Broker A is the controller and is in the middle of processing a broker change event. As part of this process, let's say it's about to shrink the isr of a partition.

      2. Then broker A's session expires and broker B takes over as the new controller. Broker B sends the initial leaderAndIsr request to all brokers.

      3. Broker A continues by shrinking the isr of the partition in ZK and sends the new leaderAndIsr request to the broker (say C) that leads the partition. Broker C will reject this leaderAndIsr since the request comes from a controller with an older epoch. Now we could be in a situation that Broker C thinks the isr has all replicas, but the isr stored in ZK is different.


          Issue Links



              • Assignee:
                onurkaraman Onur Karaman
                junrao Jun Rao
              • Votes:
                8 Vote for this issue
                26 Start watching this issue


                • Created: