Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-17044

Connector deletion can lead to resource leak during a long running connector startup

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • connect
    • None

    Description

      We have identified a gap in the shutdown flow for the connector worker. If the connector is in INIT state and still executing the WorkerConnector::doStart method, a DELETE API call would invoke the WorkerConnector::shutdown and notify() but the connector worker would not shutdown immediately. This happens because start() is a blocking call and the control reaches wait() in doRun() after the start() call has completed. This results in a gap in the delete flow where the connector is not immediately shutdown leaving the resources running. start() keeps running and only when the execution of start() completes, we reach at the point of wait() and then doShutdown() of the connector worker is invoked.
      This seems similar to what has been identified for connector tasks as part of https://issues.apache.org/jira/browse/KAFKA-14725.
      Steps to repro
      1. Start a connector with time taking operation in connector.start() call
      2. Call DELETE API to delete this connector
      3. The connector would be deleted only after the start() completes.

      The issue was observed when a connector was configured to retry a db connection for sometime. 
      Current Behaviour: The connector did not shutdown until the start() method completed.
      Expected Behaviour: The connector should abort what it is doing and shutdown as requested by the Delete call.

      Attachments

        Activity

          People

            Unassigned Unassigned
            bgoyal Bhagyashree
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: