Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-228

Reduce duplicate messages served by the kafka consumer for uncompressed topics

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.7
    • 0.7.1
    • None
    • None

    Description

      Kafka guarantees at-least once delivery of messages.The high level consumer provides highly available partitioned consumption of data within the same consumer group. In the event of broker failures or consumer failures within a group, the high level consumer rebalances and redistributes the topic partitions evenly amongst the consumers in a group. With the current design, during this rebalancing operation, Kafka introduces duplicates in the consumed data.

      This JIRA improves the rebalancing operation and the consumer iterator design to guarantee 0 duplicates while consuming uncompressed topics. There will be a small number of duplicates while serving compressed data, but it will be bound by the compression batch size.

      Attachments

        1. kafka-228_v2.patch
          5 kB
          Jun Rao
        2. kafka-228.patch
          23 kB
          Neha Narkhede
        3. kafka-228-v3.patch
          21 kB
          Neha Narkhede

        Issue Links

          Activity

            People

              nehanarkhede Neha Narkhede
              nehanarkhede Neha Narkhede
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: