Details
-
New Feature
-
Status: Resolved
-
Normal
-
Resolution: Duplicate
-
None
-
None
-
None
Description
Existing data filtering uses the following algorithm
1. find best selective predicate based on the smallest mean columns count
2. fetch rows for the best selective predicate predicate, then filter the data based on other predicates left.
So potentially we could improve the performance by
1. joining multiple predicates then do the data filtering for other predicates. 2. fine tune the best predicate selection algorithm
For multiple predicate join, it could improve performance if one predicate has many entries and another predicate has a very few of entries. It means a few index CF read, join the row keys, fetch rows then filter other predicates
Another approach is to have index on multiple columns.
Attachments
Attachments
Issue Links
- is superceded by
-
CASSANDRA-10765 add RangeIterator interface and QueryPlan for SI
- Open