Uploaded image for project: 'Apache Apex Malhar'
  1. Apache Apex Malhar
  2. APEXMALHAR-2086

Kafka Output Operator with Kafka 0.9 API

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 3.5.0
    • None
    • None

    Description

      Goal : 2 Operartors for Kafka Output

      1. Simple Kafka Output Operator

      • Supports Atleast Once
      • Expose most used producer properties as class properties

      2. Exactly Once Kafka Output ( Not possible in all the cases, will be documented later )

      Design for Exactly Once

      Window Data Manager - Stores the Kafka partitions offsets.
      Kafka Key - Used by the operator = AppID#OperatorId

      During recovery. Partially written window is re-created using the following approach:

      Tuples between the largest recovery offsets and the current offset are checked. Based on the key, tuples written by the other entities are discarded.

      Only tuples which are not in the recovered set are emitted.

      Tuples needs to be unique within the window.

      Attachments

        Issue Links

          Activity

            People

              sandesh Sandesh Hegde
              sandesh Sandesh Hegde
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: