Uploaded image for project: 'Apache Cassandra'
  1. Apache Cassandra
  2. CASSANDRA-4710

High key hashing overhead for index scans when using RandomPartitioner

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Low
    • Resolution: Fixed
    • 1.2.0 beta 2
    • None
    • None

    Description

      For a workload where the dataset is completely in ram, the md5 hashing of the keys during index scans becomes a bottleneck for reads when using RandomPartitioner, according to profiling.

      Instead performing a raw key equals check in SSTableReader.getPosition() for EQ operations improves throughput by some 30% for my workload (moving the bottleneck elsewhere).

      Attachments

        Activity

          People

            danielnorberg Daniel Norberg
            danielnorberg Daniel Norberg
            Daniel Norberg
            Jonathan Ellis
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: