Details
-
Improvement
-
Status: Resolved
-
Low
-
Resolution: Duplicate
-
None
Description
When performing a read local to a coordinator node, AbstractReadExecutor will create a new SP.LocalReadRunnable and drop it into the read stage for asynchronous execution. If you are using a client that intelligently routes read requests to a node holding the data for a given request, and are using CL.ONE/LOCAL_ONE, the enqueuing SP.LocalReadRunnable and waiting for the context switches (and possible NUMA misses) adds unneccesary latency. We can reduce that latency and improve throughput by avoiding the queueing and thread context switching by simply executing the SP.LocalReadRunnable synchronously in the request thread. Testing on a three node cluster (each with 32 cpus, 132 GB ram) yields ~10% improvement in throughput and ~20% speedup on avg/95/99 percentiles (99.9% was about 5-10% improvement).
Attachments
Attachments
Issue Links
- duplicates
-
CASSANDRA-4718 More-efficient ExecutorService for improved throughput
- Resolved
- is superceded by
-
CASSANDRA-4718 More-efficient ExecutorService for improved throughput
- Resolved