Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-1188

Stale LeaderAndIsr request could be handled by the broker on Controller failover

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • None
    • None

    Description

      ... which will cause the broker to truncate log as responding in makeFollower, and hence lose data.

      One procedure to produce this issue:

      3 brokers, 3 partitions, replication factor = 3.

      1. Broker 1 is the original controller.
      2. Broker 3 lost registration, 1 send LeaderAndIsr to 1 for isr shrinking (leader of partition 1), but not to 2 since it is not the leader.
      3. Broker 1 dies, new controller 2 sends all LeaderAndIsr to 2 and 3 (resumed), and 2 handles the makeFollower, truncates data.

      Attachments

        1. KAFKA-1188.patch
          15 kB
          Guozhang Wang
        2. KAFKA-1188_2013-12-24_11:07:12.patch
          13 kB
          Guozhang Wang
        3. KAFKA-1188_2014-01-02_10:43:54.patch
          13 kB
          Guozhang Wang
        4. KAFKA-1188_2014-01-15_16:43:30.patch
          14 kB
          Guozhang Wang
        5. KAFKA-1188_2014-01-16_11:06:56.patch
          16 kB
          Guozhang Wang
        6. KAFKA-1188_2014-01-16_15:02:22.patch
          17 kB
          Guozhang Wang
        7. KAFKA-1188_2014-01-21_14:01:22.patch
          17 kB
          Guozhang Wang
        8. KAFKA-1188_2014-02-11_17:48:06.patch
          17 kB
          Guozhang Wang
        9. KAFKA-1188_2014-02-11_17:50:06.patch
          17 kB
          Guozhang Wang

        Issue Links

          Activity

            People

              guozhang Guozhang Wang
              guozhang Guozhang Wang
              Votes:
              1 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: