Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
UpgradeTestOpenJDK8 Build 1299 times out executing lucene tests, although I'm not sure its a lucene issue.
Test results show 852 tests completed with no tests in org.apache.geode.cache.lucene package:
The geode-lucene/build/upgradeTest/upgradeTest-progress.txt file shows 108 tests completed successfully including:
20 RollingUpgradeReindexShouldBeSuccessfulWhenAllServersRollToCurrentVersion
20 RollingUpgradeQueryReturnsCorrectResultsAfterServersRollOverOnPersistentPartitionRegion
20 RollingUpgradeQueryReturnsCorrectResultAfterTwoLocatorsWithTwoServersAreRolled
20 RollingUpgradeQueryReturnsCorrectResultsAfterClientAndServersAreRolledOver
20 RollingUpgradeQueryReturnsCorrectResultsAfterServersRollOverOnPartitionRegion
8 RollingUpgradeQueryReturnsCorrectResultsAfterClientAndServersAreRestartedFromCurrentVersion
The builds before and after build 1299 show 980 total tests executed including 128 tests in org.apache.geode.cache.lucene package.
Build 1298:
Build 1300:
In addtition to the 108 tests above, there are:
20 RollingUpgradeQueryReturnsCorrectResultsAfterClientAndServersAreRolledOverAllBucketsCreated
So, this test didn't run at all in build 1299
All three callstacks for this run were stuck in the same place:
One thread was starting a locator in a member:
"Test worker" #25 prio=5 os_prio=0 tid=0x00007f4c90997000 nid=0x5e runnable [0x00007f4c0fffc000] java.lang.Thread.State: RUNNABLE at java.net.SocketInputStream.socketRead0(Native Method) at java.net.SocketInputStream.socketRead(SocketInputStream.java:116) at java.net.SocketInputStream.read(SocketInputStream.java:171) at java.net.SocketInputStream.read(SocketInputStream.java:141) at java.io.BufferedInputStream.fill(BufferedInputStream.java:246) at java.io.BufferedInputStream.read(BufferedInputStream.java:265) - locked <0x00000000fa0c7c00> (a java.io.BufferedInputStream) at java.io.DataInputStream.readByte(DataInputStream.java:265) at sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:222) at sun.rmi.server.UnicastRef.invoke(UnicastRef.java:161) at java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(RemoteObjectInvocationHandler.java:227) at java.rmi.server.RemoteObjectInvocationHandler.invoke(RemoteObjectInvocationHandler.java:179) at com.sun.proxy.$Proxy57.executeMethodOnObject(Unknown Source) at org.apache.geode.test.dunit.internal.DUnitLauncher.startLocator(DUnitLauncher.java:293) at org.apache.geode.test.dunit.internal.DUnitLauncher.launch(DUnitLauncher.java:222) at org.apache.geode.test.dunit.internal.DUnitLauncher.launchIfNeeded(DUnitLauncher.java:160) at org.apache.geode.test.dunit.internal.JUnit4DistributedTestCase.initializeDistributedTestCase(JUnit4DistributedTestCase.java:103)
That member was stuck here:
"RMI TCP Connection(1)-172.17.0.4" #32 daemon prio=5 os_prio=0 tid=0x00007f5e74001800 nid=0xcf in Object.wait() [0x00007f5e897c8000] java.lang.Thread.State: RUNNABLE at org.apache.geode.distributed.internal.AbstractDistributionConfig.<clinit>(AbstractDistributionConfig.java:931) at org.apache.geode.distributed.internal.InternalLocator.<init>(InternalLocator.java:487) at org.apache.geode.distributed.internal.InternalLocator.createLocator(InternalLocator.java:289) - locked <0x00000000f9b6b8a8> (a java.lang.Object) at org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:367) at org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:330) at org.apache.geode.distributed.Locator.startLocator(Locator.java:252) at org.apache.geode.distributed.Locator.startLocatorAndDS(Locator.java:139) at org.apache.geode.test.dunit.internal.DUnitLauncher$1.call(DUnitLauncher.java:311)
So, it looks like that test hung right at the beginning attempting to start a locator.