[CASSANDRA-3545] Fix very low Secondary Index performance - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 1.1.0
Component/s: Feature/2i Index
Labels:
None

Description

While performing index search + value filtering over large Index Row ( ~100k keys per index value) with chunks (size of 512-1024 keys) search time is about 8-12 seconds, which is very very low.

After profiling I got this picture:

60% of search time is calculating MD5 hash with MessageDigester (Of cause it is because of RundomPartitioner).
33% of search time (half of all MD5 hash calculating time) is double calculating of MD5 for comparing two row keys while rotating Index row to startKey (when performing search query for next chunk).

I see several performance improvements:

1) Use good algorithm to search startKey in sorted collection, that is faster then iteration over all keys. This solution is on first place because it simple, need only local code changes and should solve problem (increase search in multiple times).

2) Don't calculate MD5 hash for startKey every time. It's optimal to compute it once (so search will be twice faster).
Also need local code changes.

3) Think about something faster that MD5 for hashing (like TigerRandomPartitioner with Tiger/128 hash).
Need research and maybe this research was done.

4) Don't use Tokens (with MD5 hash for RandomPartitioner) for comparing and sorting keys in index rows. In index rows, keys can be stored and compared with simple Byte Comparator.
This solution requires huge code changes.

I'm going to start from first solution. Next improvements can be done with next tickets.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

0001-3545.patch
08/Dec/11 15:57
10 kB
Sylvain Lebresne
0002-cleanup.patch
08/Dec/11 15:57
14 kB
Sylvain Lebresne

Activity

People

Assignee:: Sylvain Lebresne

Reporter:: Evgeny Ryabitskiy

Authors:: Sylvain Lebresne

Reviewers:: Jonathan Ellis

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 30/Nov/11 22:40

Updated:: 16/Apr/19 09:32

Resolved:: 08/Dec/11 17:31