Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-7871

HBase can be stuck when closing regions concurrently

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.95.2
    • 0.98.0, 0.95.1
    • master
    • None
    • Reviewed

    Description

      The attached test fails ~1% of the the time on 0.96. It seems it does not fail on 0.94.5. It's simple: a table creation and some puts.

      I attach the stack. Logs says nothing it seems.
      The suspicious part is:

      "RS_CLOSE_REGION-localhost,57575,1361197489166-2" prio=10 tid=0x00007fb0c8775800 nid=0x61ac runnable [0x00007fb09f272000]
         java.lang.Thread.State: RUNNABLE
              at java.util.TreeMap.fixAfterDeletion(TreeMap.java:2193)
              at java.util.TreeMap.deleteEntry(TreeMap.java:2151)
              at java.util.TreeMap.remove(TreeMap.java:585)
              at java.util.TreeSet.remove(TreeSet.java:259)
              at org.apache.hadoop.hbase.regionserver.MetricsRegionAggregateSourceImpl.deregister(MetricsRegionAggregateSourceImpl.java:55)
              at org.apache.hadoop.hbase.regionserver.MetricsRegionSourceImpl.close(MetricsRegionSourceImpl.java:86)
              at org.apache.hadoop.hbase.regionserver.MetricsRegion.close(MetricsRegion.java:40)
              at org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1063)
              at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:969)
              - locked <0x00000006944e2558> (a java.lang.Object)
              at org.apache.hadoop.hbase.regionserver.handler.CloseRegionHandler.process(CloseRegionHandler.java:146)
              at org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:203)
              at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
              at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
              at java.lang.Thread.run(Thread.java:662)
      
      

      Attachments

        1. TestStartStop.java
          2 kB
          Nicolas Liochon
        2. s1.txt
          172 kB
          Nicolas Liochon
        3. 7871-v4.txt
          4 kB
          Ted Yu
        4. 7871-v3.txt
          4 kB
          Ted Yu
        5. 7871-v2.patch
          1 kB
          Ted Yu
        6. 7871.patch
          0.7 kB
          Ted Yu

        Activity

          People

            yuzhihong@gmail.com Ted Yu
            nkeywal Nicolas Liochon
            Votes:
            0 Vote for this issue
            Watchers:
            10 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: