[HBASE-8806] Row locks are acquired repeatedly in HRegion.doMiniBatchMutation for duplicate rows. - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Critical
Resolution: Fixed
Affects Version/s: 0.94.5
Fix Version/s: 0.94.10
Component/s: regionserver
Labels:
None

Description

If we already have the lock in the doMiniBatchMutation we don't need to re-acquire it. The solution would be to keep a cache of the rowKeys already locked for a miniBatchMutation and If we already have the
rowKey in the cache, we don't repeatedly try and acquire the lock. A fix to this problem would be to keep a set of rows we already locked and not try to acquire the lock for these rows.

We have tested this fix in our production environment and has improved replication performance quite a bit. We saw a replication batch go from 3+ minutes to less than 10 seconds for batches with duplicate row keys.

static final int ACQUIRE_LOCK_COUNT = 0;

  @Test
  public void testRedundantRowKeys() throws Exception {

    final int batchSize = 100000;
    
    String tableName = getClass().getSimpleName();
    Configuration conf = HBaseConfiguration.create();
    conf.setClass(HConstants.REGION_IMPL, MockHRegion.class, HeapSize.class);
    MockHRegion region = (MockHRegion) TestHRegion.initHRegion(Bytes.toBytes(tableName), tableName, conf, Bytes.toBytes("a"));

    List<Pair<Mutation, Integer>> someBatch = Lists.newArrayList();
    int i = 0;
    while (i < batchSize) {
      if (i % 2 == 0) {
        someBatch.add(new Pair<Mutation, Integer>(new Put(Bytes.toBytes(0)), null));
      } else {
        someBatch.add(new Pair<Mutation, Integer>(new Put(Bytes.toBytes(1)), null));
      }
      i++;
    }
    long startTime = System.currentTimeMillis();
    region.batchMutate(someBatch.toArray(new Pair[0]));
    long endTime = System.currentTimeMillis();
    long duration = endTime - startTime;
    System.out.println("duration: " + duration + " ms");
    assertEquals(2, ACQUIRE_LOCK_COUNT);
  }

  @Override
  public Integer getLock(Integer lockid, byte[] row, boolean waitForLock) throws IOException {
    ACQUIRE_LOCK_COUNT++;
    return super.getLock(lockid, row, waitForLock);
  }

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

8806-0.94-v4.txt
04/Jul/13 08:46
5 kB
Lars Hofhansl
8806-0.94-v5.txt
04/Jul/13 08:53
5 kB
Lars Hofhansl
8806-0.94-v6.txt
04/Jul/13 16:48
5 kB
Lars Hofhansl
8806-0.94-v7.txt
16/Jul/13 21:41
11 kB
Lars Hofhansl
HBASE-8806.patch
03/Jul/13 09:24
3 kB
Anoop Sam John
HBASE-8806-0.94.10.patch
26/Jun/13 00:13
8 kB
churro morales
HBASE-8806-0.94.10-v2.patch
26/Jun/13 00:29
8 kB
churro morales
HBASE-8806-0.94.10-v3.patch
04/Jul/13 01:36
14 kB
churro morales
hbase-8806-0.94-microbenchmark-no-dupe-rows.txt
15/Jul/13 16:25
5 kB
Dave Latham
hbase-8806-0.94-microbenchmarks-dupe-rows.txt
15/Jul/13 16:25
6 kB
Dave Latham
HBASE-8806-threadBasedRowLocks.patch
03/Jul/13 14:54
44 kB
Dave Latham
HBASE-8806-threadBasedRowLocks-v2.patch
04/Jul/13 15:58
43 kB
Dave Latham
row_lock_perf_results.txt
10/Jul/13 20:13
6 kB
churro morales

Issue Links

is related to

HBASE-8877 Reentrant row locks

Closed

Activity

People

Assignee:: churro morales

Reporter:: churro morales

Votes:: 0 Vote for this issue

Watchers:: 12 Start watching this issue

Dates

Created:: 25/Jun/13 19:49

Updated:: 24/Jul/13 19:08

Resolved:: 16/Jul/13 21:43