[CASSANDRA-1046] optimize Memtable.getSliceIterator - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 0.7 beta 1
Component/s: None
Labels:
None

Description

As reported by James Golick, about 30% of the time in a read is spent in SliceQueryFilter.getMemColumnIterator, virtually all of which is in ConcurrentSkipListMap$Values.toArrray().

I wrote on the ML:

Besides the UUID optimization you posted, we should do an audit of ColumnFamily.getSortedColumns and replace with iteration where possible (in this case, we'd be left with one copy of most of the columns, but that's better than two).

We can get rid of the other copy by fixing the logic in Memtable.getSliceIterator, which says "copy all the columns, so we can do a binary search on them to find where to start," but since columns are natively in sorted order we could just use an iterator and a while loo

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

0001-trunk-cassandra-1046.patch
04/Jun/10 19:58
5 kB
Matthew F. Dennis
insertarator.py
04/Jun/10 19:58
2 kB
Matthew F. Dennis
readarator.py
04/Jun/10 19:58
2 kB
Matthew F. Dennis

Activity

People

Assignee:: Matthew F. Dennis

Reporter:: Jonathan Ellis

Authors:: Matthew F. Dennis

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 03/May/10 15:37

Updated:: 16/Apr/19 09:33

Resolved:: 04/Jun/10 22:08