XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.0.0-beta-2, 2.0.0
    • wal
    • None
    • Reviewed

    Description

      Fails pretty frequently in hadoopqa builds.

      There is a recent hang in org.apache.hadoop.hbase.TestFullLogReconstruction.tearDownAfterClass(TestFullLogReconstruction.java:68)

      In here... https://builds.apache.org/job/PreCommit-HBASE-Build/11363/testReport/org.apache.hadoop.hbase/TestFullLogReconstruction/org_apache_hadoop_hbase_TestFullLogReconstruction/

      ... see here.

      Thread 1250 (RS_CLOSE_META-edd281aedb18:59863-0):
      State: TIMED_WAITING
      Blocked count: 92
      Waited count: 278
      Stack:
      java.lang.Object.wait(Native Method)
      org.apache.hadoop.hbase.regionserver.wal.SyncFuture.get(SyncFuture.java:133)
      org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL.blockOnSync(AbstractFSWAL.java:718)
      org.apache.hadoop.hbase.regionserver.wal.AsyncFSWAL.sync(AsyncFSWAL.java:605)
      org.apache.hadoop.hbase.regionserver.wal.WALUtil.doFullAppendTransaction(WALUtil.java:154)
      org.apache.hadoop.hbase.regionserver.wal.WALUtil.writeFlushMarker(WALUtil.java:81)
      org.apache.hadoop.hbase.regionserver.HRegion.internalFlushCacheAndCommit(HRegion.java:2645)
      org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2356)
      org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2328)
      org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2319)
      org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1531)
      org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1437)
      org.apache.hadoop.hbase.regionserver.handler.CloseRegionHandler.process(CloseRegionHandler.java:104)
      org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:104)
      java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
      java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
      java.lang.Thread.run(Thread.java:748)

      We missed a signal? We need to do an interrupt? The log is not all there in hadoopqa builds so hard to see all that is going on. This test is not in the flakey set either....

      Attachments

        1. HBASE-19927.patch
          5 kB
          Duo Zhang
        2. HBASE-19927-addendum.patch
          3 kB
          Duo Zhang
        3. js
          171 kB
          Duo Zhang
        4. out
          2.65 MB
          Duo Zhang

        Issue Links

          Activity

            People

              zhangduo Duo Zhang
              stack Michael Stack
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: