[HBASE-32] [hbase] Add row count estimator - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Minor
Resolution: Not A Problem
Affects Version/s: None
Fix Version/s: None
Component/s: Client
Labels:
None

Description

Internally we have a little tool that will do a rough estimate of how many rows there are in a dataHbase. It keeps getting larger and larger partitions running scanners until it turns up > N occupied rows. Once it has a number > N, it multiples by the partition size to get an approximate row count.

This issue is about generalizing this feature so it could sit in the general hbase install. It would look something like:

long getApproximateRowCount(final Text startRow, final Text endRow, final long minimumCountPerPartition, final long maximumPartitionSize)

Larger minimumCountPerPartition and maximumPartitionSize values would make the count more accurate but would mean the method ran longer.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

2291_v01.patch
20/Dec/07 07:30
9 kB
Edward J. Yoon
Keying.java
11/Dec/07 17:44
5 kB
Michael Stack

Issue Links

relates to

HBASE-1183 New MR splitting algorithm and other new features need a way to split a key range in N chunks

Closed

Activity

People

Assignee:: Unassigned

Reporter:: Michael Stack

Votes:: 1 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 27/Nov/07 18:40

Updated:: 11/Jun/22 19:47

Resolved:: 08/Jun/14 21:48