Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-228

Reduce duplicate messages served by the kafka consumer for uncompressed topics

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.7
    • Fix Version/s: 0.7.1
    • Component/s: None
    • Labels:
      None

      Description

      Kafka guarantees at-least once delivery of messages.The high level consumer provides highly available partitioned consumption of data within the same consumer group. In the event of broker failures or consumer failures within a group, the high level consumer rebalances and redistributes the topic partitions evenly amongst the consumers in a group. With the current design, during this rebalancing operation, Kafka introduces duplicates in the consumed data.

      This JIRA improves the rebalancing operation and the consumer iterator design to guarantee 0 duplicates while consuming uncompressed topics. There will be a small number of duplicates while serving compressed data, but it will be bound by the compression batch size.

        Attachments

        1. kafka-228-v3.patch
          21 kB
          Neha Narkhede
        2. kafka-228_v2.patch
          5 kB
          Jun Rao
        3. kafka-228.patch
          23 kB
          Neha Narkhede

          Issue Links

            Activity

              People

              • Assignee:
                nehanarkhede Neha Narkhede
                Reporter:
                nehanarkhede Neha Narkhede
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: