Uploaded image for project: 'Apache Apex Malhar'
  1. Apache Apex Malhar
  2. APEXMALHAR-2158

Duplication of data emitted when the Kafka Input Operator(0.8 version) redeploys

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 3.5.0
    • None
    • None

    Description

      Kafka Input Operator failure recovery depends on offsetStats map.
      "offsetStats" map is updated whenever the message is emitted from operator.

      When the operator goes down and comes back, operator sets this map to the consumer and start reading from these offsets. Consumer reads the last emitted message again and operator emits it.

      Solution: Increment the offsets before setting to the consumer.

      Attachments

        Issue Links

          Activity

            People

              chaithu Chaitanya Chebolu
              chaithu Chaitanya Chebolu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: