[CASSANDRA-2988] Improve SSTableReader.load() when loading index files - ASF JIRA

Agile Board

Attach files

Attach Screenshot

Bulk Copy Attachments

Bulk Move Attachments

Voters

Watch issue

Watchers

Create sub-task

Convert to sub-task

Move

Link

Clone

Labels

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Low
Resolution: Fixed
Fix Version/s: 1.0.0, 1.1.0
Component/s: None
Labels:
None

Description

when we create BufferredRandomAccessFile, we pass skipCache=true. This hurts the read performance because we always process the index files sequentially. Simple fix would be set it to false.
multiple index files of a single column family can be loaded in parallel. This buys a lot when you have multiple super large index files.
we may also change how we buffer. By using BufferredRandomAccessFile, for every read, we need bunch of checking like

do we need to rebuffer?
isEOF()?
assertions
These can be simplified to some extent. We can blindly buffer the index file by chunks and process the buffer until a key lies across boundary of a chunk. Then we rebuffer and start from the beginning of the partially read key. Conceptually, this is same as what BRAF does but w/o the overhead in the read**() methods in BRAF.

Attachments

2988-2-cleaned.txt
02/Nov/11 18:25
7 kB
Jonathan Ellis
2988-2-v2.txt
08/Nov/11 20:31
3 kB
Jonathan Ellis
2988-parallel-v2.txt
20/Sep/11 23:24
8 kB
Jonathan Ellis
c2988-2-v2
02/Nov/11 05:48
7 kB
Michael Wu
c2988-modified-buffer.patch
03/Aug/11 23:52
8 kB
Michael Wu
c2988-parallel-load-sstables.patch
03/Aug/11 23:51
7 kB
Michael Wu

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Michael Wu Assign to me

Reporter:: Michael Wu

Authors:: Michael Wu

Reviewers:: Jonathan Ellis

Votes:: 1 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 03/Aug/11 01:32

Updated:: 16/Apr/19 09:32

Resolved:: 23/Dec/11 18:09

Agile

View on Board

Improve SSTableReader.load() when loading index files

Details

Description

Attachments

Attachments

Activity

People

Dates

Agile

Slack

Issue deployment