Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-50

kafka intra-cluster replication support

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.8.0
    • None
    • None

    Description

      Currently, Kafka doesn't have replication. Each log segment is stored in a single broker. This limits both the availability and the durability of Kafka. If a broker goes down, all log segments stored on that broker become unavailable to consumers. If a broker dies permanently (e.g., disk failure), all unconsumed data on that node is lost forever. Our goal is to replicate every log segment to multiple broker nodes to improve both the availability and the durability.

      We'd like to support the following in Kafka replication:

      1. Configurable synchronous and asynchronous replication
      2. Small unavailable window (e.g., less than 5 seconds) during broker failures
      3. Auto recovery when a failed broker rejoins
      4. Balanced load when a broker fails (i.e., the load on the failed broker is evenly spread among multiple surviving brokers)

      Here is a complete design proposal for Kafka replication - https://cwiki.apache.org/confluence/display/KAFKA/Kafka+Replication

      Attachments

        Issue Links

          Activity

            People

              junrao Jun Rao
              junrao Jun Rao
              Votes:
              3 Vote for this issue
              Watchers:
              23 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - 1,764h
                  1,764h
                  Remaining:
                  Remaining Estimate - 1,764h
                  1,764h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified