[CASSANDRA-18515] Optimize Initial Concurrency Selection for Range Read Algorithm During SAI Queries - ASF JIRA

Agile Board

Attach files

Attach Screenshot

Bulk Copy Attachments

Bulk Move Attachments

Voters

Watch issue

Watchers

Create sub-task

Convert to sub-task

Move

Link

Clone

Labels

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 5.x
Component/s: Feature/2i Index
Labels:
None

Epic Link:
CEP-7 Storage Attached Index (SAI)
Change Category:
Performance
Complexity:
Normal
Platform:

All
Impacts:

None
Source Control Link:

https://github.com/apache/cassandra/commit/e5337eb911603126496a30c5a0b05f1863d7f817
Test and Documentation Plan:

Hide

The latest circle-ci test run is here: https://app.circleci.com/pipelines/github/mike-tr-adamson/cassandra/184/workflows/8ab95667-21d5-48b3-8b91-fb25a6914f06

Show
The latest circle-ci test run is here: https://app.circleci.com/pipelines/github/mike-tr-adamson/cassandra/184/workflows/8ab95667-21d5-48b3-8b91-fb25a6914f06

Description

The range read algorithm relies on the Index API’s notion of estimated result rows to decide how many replicas to contact in parallel during its first round of requests. The more results expected from a replica for a token range, the fewer replicas the range read will initially try to contact. Like SASI, SAI floors that estimate to a huge negative number to make sure it’s selected over other indexes, and this floors the concurrency factor to 1. The actual formula looks like this:

// resultsPerRange, from SAI, is a giant negative number
concurrencyFactor = Math.max(1, Math.min(ranges.rangeCount(), (int) Math.ceil(command.limits().count() / resultsPerRange)));

Although that concurrency factor is updated as actual results stream in, only sending a single range request to a single replica in every case for SAI is not ideal. For example, assume I have a 3 node cluster and a keyspace at RF=1, with 10 rows spread across the 3 nodes, without vnodes. Issuing a query that matches all 10 rows with a LIMIT of 10 will make 2 or 3 serial range requests from the coordinator, one to each of the 3 nodes.

This can be fixed by allowing indexes to bypass the initial concurrency calculation allowing SAI queries to contact the entire ring in a single round of queries, or at worst the minimum number of rounds as bounded by the existing statutory maximum ranges per round.

Attachments

Issue Links

Add Link

links to

GitHub Pull Request #2463

Delete this link

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Mike Adamson Assign to me

Reporter:: Mike Adamson

Authors:: Mike Adamson

Reviewers:: Andres de la Peña, Berenguer Blasi, Caleb Rackliffe

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 10/May/23 09:26

Updated:: 11/Jul/23 19:48

Resolved:: 11/Jul/23 19:48

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

Optimize Initial Concurrency Selection for Range Read Algorithm During SAI Queries

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Time Tracking

Agile

Slack

Issue deployment