Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-1436

Idempotent Producer / Duplicate Detection

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 0.8.0, 0.8.1, 0.8.1.1, 0.8.2.0, 0.10.1.0
    • None
    • consumer, producer
    • None

    Description

      Dealing with duplicate messages is one of the major issues for teams using Kafka, and Jay Kreps posted a page on implementing an Idempotent Producer to address this issue:

      https://cwiki.apache.org/confluence/display/KAFKA/Idempotent+Producer

      MapDB 1.0 (https://github.com/jankotek/MapDB) was just released, and either it or Java Chronicle (https://github.com/OpenHFT/Java-Chronicle/) could be embedded within each broker to provide a high-performance, random-access, off-heap store for request IDs.

      As Jay points out in his post, global unique request IDs probably aren't needed, but if that need should arise, Twitter's Snowflake service (https://github.com/twitter/snowflake/) might be useful.

      Attachments

        Issue Links

          Activity

            People

              nehanarkhede Neha Narkhede
              espeed James Thornton
              Votes:
              7 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: