Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-15165

Handle Kafka client certificate failures without impacting brokers

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 2.8.1
    • None
    • core, security
    • None
    • production

    Description

      Following situation is observed in production:

      Consumer or Producer SSL Certificates have expired due to mis-management of extending the certs. When these clients to connect to either read or publish messages, they get authentication failures. These clients keep on retrying and this impacts broker CPUs utilisation, which impacts other healthy clients connected to brokers.

      CPU increase observed from 35% to 85-90%. Clients which are healthy see a spike in publish and consumer latencies upwards to multiply seconds.

      This kind of situation creates a denial of service kind of attack on Kafka cluster.

      We must gracefully handle this, but either:

      1) Not allowing clients to connect or retry or do exponential retries after it fails to authenticate using SSL certs

      2) Broker side changes, where it can blacklist clients for certain duration, which can be overwritten after certs are renewed.

      Attachments

        Activity

          People

            Unassigned Unassigned
            stendu Sandeep
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: