[CASSANDRA-1555] Considerations for larger bloom filters - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 0.7.1
Component/s: None
Labels:
None

Description

To (optimally) support SSTables larger than 143 million keys, we need to support bloom filters larger than 2^31 bits, which java.util.BitSet can't handle directly.

A few options:

Switch to a BitSet class which supports 2^31 * 64 bits (Lucene's OpenBitSet)
Partition the java.util.BitSet behind our current BloomFilter
- Straightforward bit partitioning: bit N is in bitset N // 2^31
- Separate equally sized complete bloom filters for member ranges, which can be used independently or OR'd together under memory pressure.

All of these options require new approaches to serialization.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

1555_v5.txt
07/Dec/10 12:45
120 kB
T Jake Luciani
1555_v6.txt
23/Dec/10 14:28
119 kB
T Jake Luciani
1555-v7.txt
23/Dec/10 17:06
119 kB
Jonathan Ellis
cassandra-1555.tgz
17/Nov/10 19:54
12 kB
Ryan King
CASSANDRA-1555v2.patch
03/Dec/10 23:16
112 kB
Ryan King
CASSANDRA-1555v3.patch.gz
06/Dec/10 21:08
20 kB
Ryan King
CASSANDRA-1555v4.patch.gz
07/Dec/10 01:31
20 kB
Ryan King

Issue Links

is related to

CASSANDRA-4303 Compressed bloomfilters

Resolved

relates to

CASSANDRA-790 SSTables limited to (2^31)/15 keys

Resolved

Activity

People

Assignee:: Ryan King

Reporter:: Stu Hood

Authors:: Ryan King

Reviewers:: T Jake Luciani

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 29/Sep/10 04:44

Updated:: 16/Apr/19 09:33

Resolved:: 24/Dec/10 18:00