Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-32

[hbase] Add row count estimator

    Details

    • Type: New Feature
    • Status: Resolved
    • Priority: Minor
    • Resolution: Not A Problem
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: Client
    • Labels:
      None

      Description

      Internally we have a little tool that will do a rough estimate of how many rows there are in a dataHbase. It keeps getting larger and larger partitions running scanners until it turns up > N occupied rows. Once it has a number > N, it multiples by the partition size to get an approximate row count.

      This issue is about generalizing this feature so it could sit in the general hbase install. It would look something like:

      long getApproximateRowCount(final Text startRow, final Text endRow, final long minimumCountPerPartition, final long maximumPartitionSize)
      

      Larger minimumCountPerPartition and maximumPartitionSize values would make the count more accurate but would mean the method ran longer.

        Attachments

        1. 2291_v01.patch
          9 kB
          Edward J. Yoon
        2. Keying.java
          5 kB
          stack

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                stack stack
              • Votes:
                1 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: