[HBASE-6014] Support for block-granularity bitmap indexes - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Won't Fix
Affects Version/s: None
Fix Version/s: None
Component/s: regionserver
Labels:
None

Description

This came up in a discussion with Kannan today, so I promised to write something brief on JIRA – this was suggested as a potential summer intern project. The idea is as follows:

We have several customers who periodically run full table scan MR jobs against large HBase tables while applying fairly restrictive predicates. The predicates are often reasonably simple boolean expressions across known columns, and those columns often are enum-typed or otherwise have a fairly restricted range of values. For example, a real time process may mark rows as dirty, and a background MR job may scan for dirty rows in order to perform further processing like rebuilding inverted indexes.

One way to speed up this type of query is to add bitmap indexes. In the context of HBase, I would envision this as a new type of metadata block included in the HFile which has a series of tuples: (qualifier, value range, compressed bitmap). A 1 bit in the bitmap indicates that the corresponding HFile block has at least one cell for which a column with the given qualifier falls within the given range. Queries which have an equality or comparison predicate against an indexed qualifier can then use the bitmap index to seek directly to those blocks which may contain relevant data.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

6014-bitmap-hacking.txt
01/Jun/12 04:12
24 kB
Ted Yu
bitmap-hacking.txt
31/May/12 22:28
24 kB
Todd Lipcon

Issue Links

is related to

HIVE-1803 Implement bitmap indexing in Hive

Closed

ACCUMULO-652 support block-based filtering within RFile

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: Todd Lipcon

Votes:: 0 Vote for this issue

Watchers:: 17 Start watching this issue

Dates

Created:: 16/May/12 05:07

Updated:: 13/Jun/22 16:39

Resolved:: 11/Jun/22 18:30