Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-18346

TestRSKilledWhenInitializing failing on branch-1 (branch-1.4)

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.4.0, 1.5.0
    • Component/s: None
    • Labels:
      None

      Description

      Running org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitializing
      Tests run: 1, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 12.343 sec <<< FAILURE! - in org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitiali
      zing
      testRSTerminationAfterRegisteringToMasterBeforeCreatingEphemeralNode(org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitializing) Time elapsed: 12.
      329 sec <<< FAILURE!
      java.lang.AssertionError: null
      at org.junit.Assert.fail(Assert.java:86)
      at org.junit.Assert.assertTrue(Assert.java:41)
      at org.junit.Assert.assertTrue(Assert.java:52)
      at org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitializing.testRSTerminationAfterRegisteringToMasterBeforeCreatingEphemeralNode(TestRSKi
      lledWhenInitializing.java:123)

      Tests run: 2, Failures: 0, Errors: 2, Skipped: 0, Time elapsed: 192.427 sec <<< FAILURE! - in org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitial
      izing
      testRSTerminationAfterRegisteringToMasterBeforeCreatingEphemeralNode(org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitializing) Time elapsed: 179
      .859 sec <<< ERROR!
      org.junit.runners.model.TestTimedOutException: test timed out after 180 seconds
      at java.lang.Thread.sleep(Native Method)
      at org.apache.hadoop.hbase.util.Threads.sleep(Threads.java:146)
      at org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitializing.testRSTerminationAfterRegisteringToMasterBeforeCreatingEphemeralNode(TestRSKi
      lledWhenInitializing.java:113)

        Activity

        Hide
        apurtell Andrew Purtell added a comment -

        Of course, now I'm having a hard time reproducing this

        Show
        apurtell Andrew Purtell added a comment - Of course, now I'm having a hard time reproducing this
        Hide
        apurtell Andrew Purtell added a comment -

        The commit that destabilized this test is

        f45d26190274c1f91418bb2ab440a3f8546096de is the first bad commit
        commit f45d26190274c1f91418bb2ab440a3f8546096de
        Author: Phil Yang <yangzhe1991@apache.org>
        Date: Tue Mar 7 22:27:06 2017 +0800

        HBASE-15484 Correct the semantic of batch and partial - amend to fix bug and revise the JavaDoc for related APIs.

        Let me look at the test log for reason why

        Show
        apurtell Andrew Purtell added a comment - The commit that destabilized this test is f45d26190274c1f91418bb2ab440a3f8546096de is the first bad commit commit f45d26190274c1f91418bb2ab440a3f8546096de Author: Phil Yang <yangzhe1991@apache.org> Date: Tue Mar 7 22:27:06 2017 +0800 HBASE-15484 Correct the semantic of batch and partial - amend to fix bug and revise the JavaDoc for related APIs. Let me look at the test log for reason why
        Hide
        apurtell Andrew Purtell added a comment -

        Tail of the log is full of this

        java.lang.InterruptedException: sleep interrupted
                at java.lang.Thread.sleep(Native Method)
                at org.apache.hadoop.hbase.util.Threads.sleep(Threads.java:146)
                at org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitializing.testRSTerminationAfterRegisteringToMasterBeforeCreatingEphemeralNode
        (TestRSKilledWhenInitializing.java:115)
        
        Show
        apurtell Andrew Purtell added a comment - Tail of the log is full of this java.lang.InterruptedException: sleep interrupted at java.lang.Thread.sleep(Native Method) at org.apache.hadoop.hbase.util.Threads.sleep(Threads.java:146) at org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitializing.testRSTerminationAfterRegisteringToMasterBeforeCreatingEphemeralNode (TestRSKilledWhenInitializing.java:115)
        Hide
        apurtell Andrew Purtell added a comment -

        The RegisterAndDieRegionServer doesn't always go down.

        Show
        apurtell Andrew Purtell added a comment - The RegisterAndDieRegionServer doesn't always go down.
        Hide
        apurtell Andrew Purtell added a comment -

        Latest tests hang here:

        "Time-limited test" #905 daemon prio=5 os_prio=31 tid=0x00007fdfeb242000 nid=0x2de07 sleeping[0x00007000078d0000]
           java.lang.Thread.State: TIMED_WAITING (sleeping)
                at java.lang.Thread.sleep(Native Method)
                at org.apache.hadoop.hbase.util.Threads.sleep(Threads.java:146)
                at org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitializing.testRSTerminationAfterRegisteringToMasterBeforeCreatingEphemeralNode(TestRSKilledWhenInitializing.java:113)
                at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
                at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
                at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
                at java.lang.reflect.Method.invoke(Method.java:498)
                at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
                at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
                at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
                at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
                at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55)
                at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:298)
                at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:292)
                at java.util.concurrent.FutureTask.run(FutureTask.java:266)
                at java.lang.Thread.run(Thread.java:748)
        

        I'm thinking about temporarily disabling this test.

        Show
        apurtell Andrew Purtell added a comment - Latest tests hang here: "Time-limited test" #905 daemon prio=5 os_prio=31 tid=0x00007fdfeb242000 nid=0x2de07 sleeping[0x00007000078d0000] java.lang.Thread.State: TIMED_WAITING (sleeping) at java.lang.Thread.sleep(Native Method) at org.apache.hadoop.hbase.util.Threads.sleep(Threads.java:146) at org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitializing.testRSTerminationAfterRegisteringToMasterBeforeCreatingEphemeralNode(TestRSKilledWhenInitializing.java:113) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50) at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12) at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47) at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17) at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55) at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:298) at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:292) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.lang.Thread.run(Thread.java:748) I'm thinking about temporarily disabling this test.
        Hide
        apurtell Andrew Purtell added a comment -

        Disable test on branch-1 and branch-1.4

        Show
        apurtell Andrew Purtell added a comment - Disable test on branch-1 and branch-1.4
        Hide
        hudson Hudson added a comment -

        SUCCESS: Integrated in Jenkins build HBase-1.4 #913 (See https://builds.apache.org/job/HBase-1.4/913/)
        HBASE-18346 TestRSKilledWhenInitializing failing on branch-1 (apurtell: rev e985d9f15c79d7616117c0d9c0b8a45ab64f63b5)

        • (edit) hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestRSKilledWhenInitializing.java
        Show
        hudson Hudson added a comment - SUCCESS: Integrated in Jenkins build HBase-1.4 #913 (See https://builds.apache.org/job/HBase-1.4/913/ ) HBASE-18346 TestRSKilledWhenInitializing failing on branch-1 (apurtell: rev e985d9f15c79d7616117c0d9c0b8a45ab64f63b5) (edit) hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestRSKilledWhenInitializing.java
        Hide
        hudson Hudson added a comment -

        FAILURE: Integrated in Jenkins build HBase-1.5 #57 (See https://builds.apache.org/job/HBase-1.5/57/)
        HBASE-18346 TestRSKilledWhenInitializing failing on branch-1 (apurtell: rev 0621486620783830c766955c3d9f4feb878248e1)

        • (edit) hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestRSKilledWhenInitializing.java
        Show
        hudson Hudson added a comment - FAILURE: Integrated in Jenkins build HBase-1.5 #57 (See https://builds.apache.org/job/HBase-1.5/57/ ) HBASE-18346 TestRSKilledWhenInitializing failing on branch-1 (apurtell: rev 0621486620783830c766955c3d9f4feb878248e1) (edit) hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestRSKilledWhenInitializing.java

          People

          • Assignee:
            apurtell Andrew Purtell
            Reporter:
            apurtell Andrew Purtell
          • Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development