Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-12853

distributed write pattern to replace ad hoc 'salting'

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Invalid
    • None
    • None
    • None
    • None

    Description

      In reviewing HBASE-11682 (Description of Hot Spotting), one of the issues is that while 'salting' alleviated regional hot spotting, it increased the complexity required to utilize the data.

      Through the use of coprocessors, it should be possible to offer a method which distributes the data on write across the cluster and then manages reading the data returning a sort ordered result set, abstracting the underlying process.

      On table creation, a flag is set to indicate that this is a parallel table.

      On insert in to the table, if the flag is set to true then a prefix is added to the key. e.g. <region server#>- or <region server #|| where the region server # is an integer between 1 and the number of region servers defined.

      On read (scan) for each region server defined, a separate scan is created adding the prefix. Since each scan will be in sort order, its possible to strip the prefix and return the lowest value key from each of the subsets.

      Attachments

        Activity

          People

            Unassigned Unassigned
            msegel Michael Segel
            Votes:
            1 Vote for this issue
            Watchers:
            12 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: