Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-3610

Improve RegionSplitter Performance

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Trivial
    • Resolution: Fixed
    • None
    • 0.90.2, 0.92.0
    • None
    • None

    Description

      When running RegionSplitter on a 100-node cluster with 900 regions (and plenty of data), the utility took around 72 hours to run. Analysis revealed two major bottlenecks:

      1. We are serialized on the logical split (i.e. waiting for the split request to be registered). Parallelizing this step will align configured and actual outstanding splits.
      2. Outstanding splits are modeled like a queue. Changing this to a list with a scanner will allow handling splits that finish out of order.

      Attachments

        Issue Links

          Activity

            People

              nspiegelberg Nicolas Spiegelberg
              nspiegelberg Nicolas Spiegelberg
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: