Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-10888

Sticky partition leads to uneven product msg, resulting in abnormal delays in some partitions

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.4.1
    • 3.3.0
    • clients, producer
    • None

    Description

        110 producers ,550 partitions ,550 consumers , 5 nodes Kafka cluster
        The producer uses the nullkey+stick partitioner, the total production rate is about 100w tps
      Observed partition delay is abnormal and message distribution is uneven, which leads to the maximum production and consumption delay of the partition with more messages
      abnormal.

        I cannot find reason that stick will make the message distribution uneven at this production rate.
        I can't switch to the round-robin partitioner, which will increase the delay and cpu cost. Is thathe stick partationer design cause uneven message distribution, or this is abnormal. How to solve it?

       

      As shown in the picture, the uneven distribution is concentrated on some partitions and some brokers, there seems to be some rules.

      This problem does not only occur in one cluster, but in many high tps clusters,

      The problem is more obvious on the test cluster we built.

      Attachments

        Issue Links

          Activity

            People

              alivshits Artem Livshits
              jr981008 jr
              Votes:
              0 Vote for this issue
              Watchers:
              13 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: