[HBASE-13389] [REGRESSION] HBASE-12600 undoes skip-mvcc parse optimizations - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Closed
Priority: Major
Resolution: Incomplete
Affects Version/s: None
Fix Version/s: None
Component/s: Performance
Labels:
None

Description

~~HBASE-12600~~ moved the edit sequenceid from tags to instead exploit the mvcc/sequenceid slot in a key. Now Cells near-always have an associated mvcc/sequenceid where previous it was rare or the mvcc was kept up at the file level. This is sort of how it should be many of us would argue but as a side-effect of this change, read-time optimizations that helped speed scans were undone by this change.

In this issue, lets see if we can get the optimizations back – or just remove the optimizations altogether.

The parse of mvcc/sequenceid is expensive. It was noticed over in ~~HBASE-13291~~.

The optimizations undone by this changes are (to quote the optimizer himself, Mr lhofhansl):

Looks like this undoes all of ~~HBASE-9751~~, ~~HBASE-8151~~, and ~~HBASE-8166~~.
We're always storing the mvcc readpoints, and we never compare them against the actual smallestReadpoint, and hence we're always performing all the checks, tests, and comparisons that these jiras removed in addition to actually storing the data - which with up to 8 bytes per Cell is not trivial.

This is the 'breaking' change: https://github.com/apache/hbase/commit/2c280e62530777ee43e6148fd6fcf6dac62881c0#diff-07c7ac0a9179cedff02112489a20157fR96

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

13389.txt
04/Apr/15 18:47
2 kB
Lars Hofhansl

Activity

People

Assignee:: Unassigned

Reporter:: Michael Stack

Votes:: 0 Vote for this issue

Watchers:: 13 Start watching this issue

Dates

Created:: 02/Apr/15 21:48

Updated:: 23/Jun/22 20:46

Resolved:: 23/Jun/22 20:46