Uploaded image for project: 'Phoenix'
  1. Phoenix
  2. PHOENIX-3448

Possible deadlock because transaction service client is stuck

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • None
    • None

    Description

      I have noticed on my local test run that sometimes IndexExtendedIT isn't completing. On taking a thread dump, I am seeing two threads waiting for the lock (waiting to lock <0x0000000788e69f50>) as follows:

      "StoreFileOpenerThread-0-1" daemon prio=5 tid=0x00007fc796e43800 nid=0x2631f waiting for monitor entry [0x0000000128f62000]
         java.lang.Thread.State: BLOCKED (on object monitor)
      	at org.apache.phoenix.query.ConnectionQueryServicesImpl$13.call(ConnectionQueryServicesImpl.java:2369)
      	- waiting to lock <0x0000000788e69f50> (a org.apache.phoenix.query.ConnectionQueryServicesImpl)
      	at org.apache.phoenix.query.ConnectionQueryServicesImpl$13.call(ConnectionQueryServicesImpl.java:2358)
      	at org.apache.phoenix.util.PhoenixContextExecutor.call(PhoenixContextExecutor.java:76)
      	at org.apache.phoenix.query.ConnectionQueryServicesImpl.init(ConnectionQueryServicesImpl.java:2358)
      	at org.apache.phoenix.jdbc.PhoenixDriver.getConnectionQueryServices(PhoenixDriver.java:232)
      	at org.apache.phoenix.jdbc.PhoenixEmbeddedDriver.createConnection(PhoenixEmbeddedDriver.java:147)
      	at org.apache.phoenix.jdbc.PhoenixDriver.connect(PhoenixDriver.java:202)
      	at java.sql.DriverManager.getConnection(DriverManager.java:571)
      	at java.sql.DriverManager.getConnection(DriverManager.java:187)
      	at org.apache.phoenix.util.QueryUtil.getConnection(QueryUtil.java:341)
      	at org.apache.phoenix.util.QueryUtil.getConnectionOnServer(QueryUtil.java:328)
      	at org.apache.phoenix.util.QueryUtil.getConnectionOnServer(QueryUtil.java:318)
      	at org.apache.hadoop.hbase.regionserver.IndexHalfStoreFileReaderGenerator.preStoreFileReaderOpen(IndexHalfStoreFileReaderGenerator.java:144)
      	at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost$64.call(RegionCoprocessorHost.java:1545)
      	at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost$RegionOperation.call(RegionCoprocessorHost.java:1656)
      	at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.execOperation(RegionCoprocessorHost.java:1733)
      	at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.execOperationWithResult(RegionCoprocessorHost.java:1695)
      	at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preStoreFileReaderOpen(RegionCoprocessorHost.java:1540)
      	at org.apache.hadoop.hbase.regionserver.StoreFileInfo.open(StoreFileInfo.java:205)
      	at org.apache.hadoop.hbase.regionserver.StoreFile.open(StoreFile.java:380)
      	at org.apache.hadoop.hbase.regionserver.StoreFile.createReader(StoreFile.java:481)
      	at org.apache.hadoop.hbase.regionserver.StoreFile.createReader(StoreFile.java:471)
      	at org.apache.hadoop.hbase.regionserver.HStore.createStoreFileAndReader(HStore.java:571)
      	at org.apache.hadoop.hbase.regionserver.HStore.access$000(HStore.java:123)
      	at org.apache.hadoop.hbase.regionserver.HStore$1.call(HStore.java:513)
      	at org.apache.hadoop.hbase.regionserver.HStore$1.call(HStore.java:510)
      	at java.util.concurrent.FutureTask.run(FutureTask.java:262)
      	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
      	at java.util.concurrent.FutureTask.run(FutureTask.java:262)
      	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
      	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
      	at java.lang.Thread.run(Thread.java:745)
      

      The thread which has the lock is stuck because starting the transaction client service hasn't returned. The stacktrace for that thread looks like this:

      "StoreFileOpenerThread-0-1" daemon prio=5 tid=0x00007fc797088000 nid=0x29717 waiting on condition [0x00000001295e1000]
         java.lang.Thread.State: WAITING (parking)
      	at sun.misc.Unsafe.park(Native Method)
      	- parking to wait for  <0x0000000788f1bb58> (a com.google.common.util.concurrent.AbstractFuture$Sync)
      	at java.util.concurrent.locks.LockSupport.park(LockSupport.java:186)
      	at java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:834)
      	at java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:994)
      	at java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1303)
      	at com.google.common.util.concurrent.AbstractFuture$Sync.get(AbstractFuture.java:280)
      	at com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:116)
      	at com.google.common.util.concurrent.Uninterruptibles.getUninterruptibly(Uninterruptibles.java:132)
      	at com.google.common.util.concurrent.Futures.getUnchecked(Futures.java:999)
      	at org.apache.twill.zookeeper.ForwardingZKClientService.startAndWait(ForwardingZKClientService.java:51)
      	at org.apache.phoenix.query.ConnectionQueryServicesImpl.initTxServiceClient(ConnectionQueryServicesImpl.java:384)
      	at org.apache.phoenix.query.ConnectionQueryServicesImpl.openConnection(ConnectionQueryServicesImpl.java:398)
      	at org.apache.phoenix.query.ConnectionQueryServicesImpl.access$300(ConnectionQueryServicesImpl.java:234)
      	at org.apache.phoenix.query.ConnectionQueryServicesImpl$13.call(ConnectionQueryServicesImpl.java:2380)
      	- locked <0x0000000788e69f50> (a org.apache.phoenix.query.ConnectionQueryServicesImpl)
      	at org.apache.phoenix.query.ConnectionQueryServicesImpl$13.call(ConnectionQueryServicesImpl.java:2358)
      	at org.apache.phoenix.util.PhoenixContextExecutor.call(PhoenixContextExecutor.java:76)
      	at org.apache.phoenix.query.ConnectionQueryServicesImpl.init(ConnectionQueryServicesImpl.java:2358)
      	at org.apache.phoenix.jdbc.PhoenixDriver.getConnectionQueryServices(PhoenixDriver.java:232)
      	at org.apache.phoenix.jdbc.PhoenixEmbeddedDriver.createConnection(PhoenixEmbeddedDriver.java:147)
      	at org.apache.phoenix.jdbc.PhoenixDriver.connect(PhoenixDriver.java:202)
      	at java.sql.DriverManager.getConnection(DriverManager.java:571)
      	at java.sql.DriverManager.getConnection(DriverManager.java:187)
      	at org.apache.phoenix.util.QueryUtil.getConnection(QueryUtil.java:341)
      	at org.apache.phoenix.util.QueryUtil.getConnectionOnServer(QueryUtil.java:328)
      	at org.apache.phoenix.util.QueryUtil.getConnectionOnServer(QueryUtil.java:318)
      	at org.apache.hadoop.hbase.regionserver.IndexHalfStoreFileReaderGenerator.preStoreFileReaderOpen(IndexHalfStoreFileReaderGenerator.java:144)
      	at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost$64.call(RegionCoprocessorHost.java:1545)
      	at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost$RegionOperation.call(RegionCoprocessorHost.java:1656)
      	at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.execOperation(RegionCoprocessorHost.java:1733)
      	at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.execOperationWithResult(RegionCoprocessorHost.java:1695)
      	at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preStoreFileReaderOpen(RegionCoprocessorHost.java:1540)
      	at org.apache.hadoop.hbase.regionserver.StoreFileInfo.open(StoreFileInfo.java:205)
      	at org.apache.hadoop.hbase.regionserver.StoreFile.open(StoreFile.java:380)
      	at org.apache.hadoop.hbase.regionserver.StoreFile.createReader(StoreFile.java:481)
      	at org.apache.hadoop.hbase.regionserver.StoreFile.createReader(StoreFile.java:471)
      	at org.apache.hadoop.hbase.regionserver.HStore.createStoreFileAndReader(HStore.java:571)
      	at org.apache.hadoop.hbase.regionserver.HStore.access$000(HStore.java:123)
      	at org.apache.hadoop.hbase.regionserver.HStore$1.call(HStore.java:513)
      	at org.apache.hadoop.hbase.regionserver.HStore$1.call(HStore.java:510)
      	at java.util.concurrent.FutureTask.run(FutureTask.java:262)
      	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
      	at java.util.concurrent.FutureTask.run(FutureTask.java:262)
      	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
      	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
      	at java.lang.Thread.run(Thread.java:745)
      

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              samarthjain Samarth Jain
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: