Uploaded image for project: 'Samza'
  1. Samza
  2. SAMZA-111

SystemConsumers is slow with large partition count

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.6.0
    • 0.7.0
    • container
    • None

    Description

      We have been seeing very slow processing speed when running a Samza container that consumes from 1000s of partitions. We don't see a corresponding slow speed when running the same code, but with fewer input partitions (say 8-24).

      The messages per second seems to drop off as more partitions are added to the Samza container. One Samza job has ~2500 partitions, and is seeing only 6000 messages/sec. The same code running with ~9 partitions is seeing 30,000 messages/sec.

      Attachments

        1. SAMZA-111.1.patch
          32 kB
          Chris Riccomini
        2. SAMZA-111.0.png
          172 kB
          Chris Riccomini
        3. SAMZA-111.0.patch
          30 kB
          Chris Riccomini
        4. samza-perf-hacks.0.diff
          33 kB
          Chris Riccomini
        5. samza-perf-hacks.png
          185 kB
          Chris Riccomini
        6. 12-threads-1000-streams-4-partitions-each-with-hacky-fix.png
          100 kB
          Chris Riccomini
        7. 12-threads-1000-streams-4-partitions-each.png
          105 kB
          Chris Riccomini
        8. 12-threads-8-streams-4-partitions-each.png
          109 kB
          Chris Riccomini

        Activity

          People

            criccomini Chris Riccomini
            criccomini Chris Riccomini
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: