[HBASE-4465] Lazy-seek optimization for StoreFile scanners - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.89.20100924, 0.94.0
Component/s: None
Labels:
- optimization
- seek

Hadoop Flags:

Reviewed
Release Note:
Check the most recent file first before seeking all other files in a Store.

Description

Previously, if we had several StoreFiles for a column family in a region, we would seek in each of them and only then merge the results, even though the row/column we are looking for might only be in the most recent (and the smallest) file. Now we prioritize our reads from those files so that we check the most recent file first. This is done by doing a "lazy seek" which pretends that the next value in the StoreFile is (seekRow, seekColumn, lastTimestampInStoreFile), which is earlier in the KV order than anything that might actually occur in the file. So if we don't find the result in earlier files, that fake KV will bubble up to the top of the KV heap and a real seek will be done. This is expected to significantly reduce the amount of disk IO (as of 09/22/2011 we are doing dark launch testing and measurement).

This is joint work with Liyin Tang – huge thanks to him for many helpful discussions on this and the idea of putting fake KVs with the highest timestamp of the StoreFile in the scanner priority queue.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HBASE-4465_Lazy-seek_optimization_for_St-20111005121052-b2ea8753.patch
05/Oct/11 20:52
40 kB
Mikhail Gryzykhin

Issue Links

is required by

HBASE-4469 Avoid top row seek by looking up ROWCOL bloomfilter

Closed

HBASE-4532 Avoid top row seek by dedicated bloom filter for delete family bloom filter

Closed

relates to

HBASE-8001 Avoid unnecessary lazy seek

Closed

requires

HBASE-4534 A new unit test for lazy seek and StoreScanner in general

Closed

Activity

People

Assignee:: Mikhail Gryzykhin

Reporter:: Mikhail Gryzykhin

Votes:: 0 Vote for this issue

Watchers:: 11 Start watching this issue

Dates

Created:: 23/Sep/11 03:09

Updated:: 05/Mar/13 08:32

Resolved:: 05/Oct/11 21:32