Uploaded image for project: 'Apache Cassandra'
  1. Apache Cassandra
  2. CASSANDRA-16078

Performance regression for queries accessing multiple rows

    XMLWordPrintableJSON

Details

    • Degradation - Performance Bug/Regression
    • Critical
    • Normal
    • Performance Regression Test
    • All
    • None

    Description

      This is spin off from CASSANDRA-16036.

      In testing 4.0 relative to 3.0* I found that queries which accessed multiple rows to have a noticeable performance decrease; two queries were used in the test (more may be impacted, others might not): query partition (table has clustering keys) with LIMIT, and query clustering keys using IN clause.

      In the below graphs the green line is 3.0 and the other lines are 4.0 (with and without chunk cache)

      Partition with LIMIT


      Cluster with IN clause


      Attachments

        1. async-profile-3.0.19-3.svg
          567 kB
          Caleb Rackliffe
        2. async-profile-4.0.0-3.svg
          521 kB
          Caleb Rackliffe
        3. latency_selects_3_4.png
          78 kB
          Caleb Rackliffe
        4. ClusteringSlicing.kt
          3 kB
          Brandon Williams
        5. image.png
          99 kB
          Ekaterina Dimitrova

        Issue Links

          Activity

            People

              maedhroz Caleb Rackliffe
              dcapwell David Capwell
              Caleb Rackliffe
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: