[HBASE-15560] TinyLFU-based BlockCache - ASF JIRA

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 2.0.0
Fix Version/s: 3.0.0-alpha-1, 2.3.0
Component/s: BlockCache
Labels:
None

Hadoop Flags:

Reviewed
Release Note:

Hide
LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.

This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.

New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.

Show
LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns. This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates. New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.

Description

LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O( n ) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.

W-TinyLFU (research paper) records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival (candidate) to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.

Concurrency is achieved by buffering and replaying the operations, similar to a write-ahead log. A read is recorded into a striped ring buffer and writes to a queue. The operations are applied in batches under a try-lock by an asynchronous thread, thereby track the usage pattern without incurring high latencies (benchmarks).

In YCSB benchmarks the results were inconclusive. For a large cache (99% hit rates) the two caches have near identical throughput and latencies with LruBlockCache narrowly winning. At medium and small caches, TinyLFU had a 1-4% hit rate improvement and therefore lower latencies. The lack luster result is because a synthetic Zipfian distribution is used, which SLRU performs optimally. In a more varied, real-world workload we'd expect to see improvements by being able to make smarter predictions.

The provided patch implements BlockCache using the Caffeine caching library (see HighScalability article).

Edward Bortnikov and Eshcar Hillel have graciously provided guidance for evaluating this patch (github branch).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

bc.hit.count
04/Nov/16 17:10
21 kB
Michael Stack
bc.miss.count
04/Nov/16 17:10
25 kB
Michael Stack
branch-1.tinylfu.txt
04/Nov/16 20:50
100 kB
Michael Stack
gets
04/Nov/16 17:10
19 kB
Michael Stack
HBASE-15560.patch
03/Apr/19 19:14
55 kB
Andrew Kyle Purtell
HBASE-15560.patch
29/Mar/19 00:02
55 kB
Andrew Kyle Purtell
HBASE-15560.patch
26/Mar/19 18:31
54 kB
Andrew Kyle Purtell
HBASE-15560.patch
26/Mar/19 17:32
54 kB
Andrew Kyle Purtell
HBASE-15560.patch
25/Mar/19 23:38
54 kB
Andrew Kyle Purtell
HBASE-15560.patch
04/Oct/16 01:11
57 kB
Ben Manes
HBASE-15560.patch
03/Oct/16 00:43
66 kB
Ben Manes
HBASE-15560.patch
28/Sep/16 03:18
66 kB
Ben Manes
HBASE-15560.patch
27/Sep/16 18:43
66 kB
Ben Manes
HBASE-15560.patch
27/Sep/16 05:43
65 kB
Ben Manes
HBASE-15560.patch
26/Sep/16 22:48
64 kB
Ben Manes
HBASE-15560.patch
12/Sep/16 16:38
33 kB
Ben Manes
run_ycsb_c.sh
08/Nov/16 18:44
6 kB
Michael Stack
run_ycsb_loading.sh
08/Nov/16 18:44
3 kB
Michael Stack
tinylfu.patch
29/Mar/16 18:58
33 kB
Ben Manes

Issue Links

is blocked by

HBASE-15624 Move master branch/hbase-2.0.0 to jdk-8 only

Closed

Sub-Tasks

1.

New 2.0 blockcache (tinylfu) doesn't have inmemory partition, etc Update doc and codebase accordingly

Patch Available

Biju Nair

TinyLFU-based BlockCache

Details

Description

Attachments

Attachments

Issue Links

Sub-Tasks

Activity

People

Dates