Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-24298

Reduce cpu load of locating region especially in batch mode.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • 1.2.12
    • None
    • Performance
    • None

    Description

      Binary search is used to speedup the process of locating region. It is already fast enough, while cpu of HBASE client becomes the bottleneck when doing TCSB benchmark. We can make the process of locating region faster to reduce cpu load in some special cases , which however is our common case in production environment.  It is the case: 

              1. Predefined splits in uniform distribution.
       
              2. Load data in batch mode.

      The optimization is very simple, just to contract range of binary search. Initially,  record all startIndex and endIndex of first or two bytes of keys. When a region key comes, find the contracted startIndex and endIndex of the key. Then return to normal binary search process with the specified startIndex and endIndex. 

      Then we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.

      Attachments

        1. locating region.svg
          103 kB
          star
        2. HBASE-24298.patch
          28 kB
          star

        Activity

          People

            starphin star
            starphin star
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated: