Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-24357

ZooKeeperLeaderElectionConnectionHandlingTest#testLoseLeadershipOnLostConnectionIfTolerateSuspendedConnectionsIsEnabled fails with an Unhandled error

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      In a private azure build when testing my own PR I've noticed the following error that looks unrelated to any of my changes (modifications to Task class error/cancellation handling logic):

      2021-09-22T08:09:16.6244936Z Sep 22 08:09:16 [ERROR] testLoseLeadershipOnLostConnectionIfTolerateSuspendedConnectionsIsEnabled  Time elapsed: 28.753 s  <<< FAILURE!
      2021-09-22T08:09:16.6245821Z Sep 22 08:09:16 java.lang.AssertionError: The TestingFatalErrorHandler caught an exception.
      2021-09-22T08:09:16.6246513Z Sep 22 08:09:16 	at org.apache.flink.runtime.util.TestingFatalErrorHandlerResource.after(TestingFatalErrorHandlerResource.java:78)
      2021-09-22T08:09:16.6247281Z Sep 22 08:09:16 	at org.apache.flink.runtime.util.TestingFatalErrorHandlerResource.access$300(TestingFatalErrorHandlerResource.java:33)
      2021-09-22T08:09:16.6248167Z Sep 22 08:09:16 	at org.apache.flink.runtime.util.TestingFatalErrorHandlerResource$1.evaluate(TestingFatalErrorHandlerResource.java:57)
      2021-09-22T08:09:16.6248862Z Sep 22 08:09:16 	at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:54)
      2021-09-22T08:09:16.6249620Z Sep 22 08:09:16 	at org.apache.flink.util.TestNameProvider$1.evaluate(TestNameProvider.java:45)
      2021-09-22T08:09:16.6250210Z Sep 22 08:09:16 	at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:61)
      2021-09-22T08:09:16.6250773Z Sep 22 08:09:16 	at org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
      2021-09-22T08:09:16.6251375Z Sep 22 08:09:16 	at org.junit.runners.BlockJUnit4ClassRunner$1.evaluate(BlockJUnit4ClassRunner.java:100)
      2021-09-22T08:09:16.6251951Z Sep 22 08:09:16 	at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:366)
      2021-09-22T08:09:16.6252562Z Sep 22 08:09:16 	at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:103)
      2021-09-22T08:09:16.6253415Z Sep 22 08:09:16 	at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:63)
      2021-09-22T08:09:16.6254469Z Sep 22 08:09:16 	at org.junit.runners.ParentRunner$4.run(ParentRunner.java:331)
      2021-09-22T08:09:16.6255039Z Sep 22 08:09:16 	at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:79)
      2021-09-22T08:09:16.6256238Z Sep 22 08:09:16 	at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:329)
      2021-09-22T08:09:16.6257109Z Sep 22 08:09:16 	at org.junit.runners.ParentRunner.access$100(ParentRunner.java:66)
      2021-09-22T08:09:16.6257766Z Sep 22 08:09:16 	at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:293)
      2021-09-22T08:09:16.6258406Z Sep 22 08:09:16 	at org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
      2021-09-22T08:09:16.6259050Z Sep 22 08:09:16 	at org.junit.runners.ParentRunner.run(ParentRunner.java:413)
      2021-09-22T08:09:16.6259827Z Sep 22 08:09:16 	at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
      2021-09-22T08:09:16.6260963Z Sep 22 08:09:16 	at org.junit.runner.JUnitCore.run(JUnitCore.java:115)
      2021-09-22T08:09:16.6261796Z Sep 22 08:09:16 	at org.junit.vintage.engine.execution.RunnerExecutor.execute(RunnerExecutor.java:43)
      2021-09-22T08:09:16.6262428Z Sep 22 08:09:16 	at java.util.stream.ForEachOps$ForEachOp$OfRef.accept(ForEachOps.java:183)
      2021-09-22T08:09:16.6263268Z Sep 22 08:09:16 	at java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:193)
      2021-09-22T08:09:16.6263875Z Sep 22 08:09:16 	at java.util.Iterator.forEachRemaining(Iterator.java:116)
      2021-09-22T08:09:16.6265025Z Sep 22 08:09:16 	at java.util.Spliterators$IteratorSpliterator.forEachRemaining(Spliterators.java:1801)
      2021-09-22T08:09:16.6265940Z Sep 22 08:09:16 	at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:482)
      2021-09-22T08:09:16.6266767Z Sep 22 08:09:16 	at java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:472)
      2021-09-22T08:09:16.6267470Z Sep 22 08:09:16 	at java.util.stream.ForEachOps$ForEachOp.evaluateSequential(ForEachOps.java:150)
      2021-09-22T08:09:16.6268165Z Sep 22 08:09:16 	at java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateSequential(ForEachOps.java:173)
      2021-09-22T08:09:16.6269341Z Sep 22 08:09:16 	at java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234)
      2021-09-22T08:09:16.6269928Z Sep 22 08:09:16 	at java.util.stream.ReferencePipeline.forEach(ReferencePipeline.java:485)
      2021-09-22T08:09:16.6270951Z Sep 22 08:09:16 	at org.junit.vintage.engine.VintageTestEngine.executeAllChildren(VintageTestEngine.java:82)
      2021-09-22T08:09:16.6271683Z Sep 22 08:09:16 	at org.junit.vintage.engine.VintageTestEngine.execute(VintageTestEngine.java:73)
      2021-09-22T08:09:16.6274483Z Sep 22 08:09:16 	at org.junit.platform.launcher.core.DefaultLauncher.execute(DefaultLauncher.java:220)
      2021-09-22T08:09:16.6275060Z Sep 22 08:09:16 	at org.junit.platform.launcher.core.DefaultLauncher.lambda$execute$6(DefaultLauncher.java:188)
      2021-09-22T08:09:16.6275819Z Sep 22 08:09:16 	at org.junit.platform.launcher.core.DefaultLauncher.withInterceptedStreams(DefaultLauncher.java:202)
      2021-09-22T08:09:16.6276557Z Sep 22 08:09:16 	at org.junit.platform.launcher.core.DefaultLauncher.execute(DefaultLauncher.java:181)
      2021-09-22T08:09:16.6277234Z Sep 22 08:09:16 	at org.junit.platform.launcher.core.DefaultLauncher.execute(DefaultLauncher.java:128)
      2021-09-22T08:09:16.6277770Z Sep 22 08:09:16 	at org.apache.maven.surefire.junitplatform.JUnitPlatformProvider.invokeAllTests(JUnitPlatformProvider.java:150)
      2021-09-22T08:09:16.6278345Z Sep 22 08:09:16 	at org.apache.maven.surefire.junitplatform.JUnitPlatformProvider.invoke(JUnitPlatformProvider.java:120)
      2021-09-22T08:09:16.6278909Z Sep 22 08:09:16 	at org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:384)
      2021-09-22T08:09:16.6279433Z Sep 22 08:09:16 	at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:345)
      2021-09-22T08:09:16.6279939Z Sep 22 08:09:16 	at org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:126)
      2021-09-22T08:09:16.6280627Z Sep 22 08:09:16 	at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:418)
      2021-09-22T08:09:16.6281222Z Sep 22 08:09:16 Caused by: org.apache.flink.runtime.leaderelection.LeaderElectionException: Unhandled error in ZooKeeperLeaderElectionDriver: Ensure path threw exception
      2021-09-22T08:09:16.6281884Z Sep 22 08:09:16 	at org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionDriver.unhandledError(ZooKeeperLeaderElectionDriver.java:295)
      2021-09-22T08:09:16.6282527Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl$6.apply(CuratorFrameworkImpl.java:713)
      2021-09-22T08:09:16.6283172Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl$6.apply(CuratorFrameworkImpl.java:709)
      2021-09-22T08:09:16.6283803Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.listen.ListenerContainer$1.run(ListenerContainer.java:100)
      2021-09-22T08:09:16.6284454Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.shaded.com.google.common.util.concurrent.DirectExecutor.execute(DirectExecutor.java:30)
      2021-09-22T08:09:16.6285302Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.listen.ListenerContainer.forEach(ListenerContainer.java:92)
      2021-09-22T08:09:16.6286162Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl.logError(CuratorFrameworkImpl.java:708)
      2021-09-22T08:09:16.6286836Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.NamespaceImpl.fixForNamespace(NamespaceImpl.java:100)
      2021-09-22T08:09:16.6287505Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl.fixForNamespace(CuratorFrameworkImpl.java:731)
      2021-09-22T08:09:16.6288396Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.WatcherRemovalFacade.fixForNamespace(WatcherRemovalFacade.java:170)
      2021-09-22T08:09:16.6289236Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.GetDataBuilderImpl.forPath(GetDataBuilderImpl.java:295)
      2021-09-22T08:09:16.6289862Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.GetDataBuilderImpl.forPath(GetDataBuilderImpl.java:35)
      2021-09-22T08:09:16.6290487Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.recipes.cache.TreeCache$TreeNode.doRefreshData(TreeCache.java:287)
      2021-09-22T08:09:16.6291319Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.recipes.cache.TreeCache$TreeNode.refreshData(TreeCache.java:266)
      2021-09-22T08:09:16.6291975Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.recipes.cache.TreeCache$TreeNode.refresh(TreeCache.java:250)
      2021-09-22T08:09:16.6292615Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.recipes.cache.TreeCache$TreeNode.wasCreated(TreeCache.java:316)
      2021-09-22T08:09:16.6293277Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.recipes.cache.TreeCache.handleStateChange(TreeCache.java:819)
      2021-09-22T08:09:16.6293918Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.recipes.cache.TreeCache.access$1800(TreeCache.java:75)
      2021-09-22T08:09:16.6294555Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.recipes.cache.TreeCache$1.stateChanged(TreeCache.java:543)
      2021-09-22T08:09:16.6295373Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.state.ConnectionStateManager.lambda$processEvents$1(ConnectionStateManager.java:280)
      2021-09-22T08:09:16.6296073Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.listen.MappingListenerManager.lambda$forEach$0(MappingListenerManager.java:93)
      2021-09-22T08:09:16.6296829Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.listen.MappingListenerManager.forEach(MappingListenerManager.java:90)
      2021-09-22T08:09:16.6297559Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.listen.StandardListenerManager.forEach(StandardListenerManager.java:89)
      2021-09-22T08:09:16.6298402Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.state.ConnectionStateManager.processEvents(ConnectionStateManager.java:280)
      2021-09-22T08:09:16.6299099Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.state.ConnectionStateManager.access$000(ConnectionStateManager.java:44)
      2021-09-22T08:09:16.6300215Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.state.ConnectionStateManager$1.call(ConnectionStateManager.java:133)
      2021-09-22T08:09:16.6318237Z Sep 22 08:09:16 	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      2021-09-22T08:09:16.6318982Z Sep 22 08:09:16 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
      2021-09-22T08:09:16.6323311Z Sep 22 08:09:16 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
      2021-09-22T08:09:16.6326943Z Sep 22 08:09:16 	at java.lang.Thread.run(Thread.java:748)
      2021-09-22T08:09:16.6327544Z Sep 22 08:09:16 Caused by: org.apache.flink.shaded.zookeeper3.org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /flink
      2021-09-22T08:09:16.6328187Z Sep 22 08:09:16 	at org.apache.flink.shaded.zookeeper3.org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
      2021-09-22T08:09:16.6328785Z Sep 22 08:09:16 	at org.apache.flink.shaded.zookeeper3.org.apache.zookeeper.KeeperException.create(KeeperException.java:54)
      2021-09-22T08:09:16.6329526Z Sep 22 08:09:16 	at org.apache.flink.shaded.zookeeper3.org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1111)
      2021-09-22T08:09:16.6330053Z Sep 22 08:09:16 	at org.apache.flink.shaded.zookeeper3.org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1139)
      2021-09-22T08:09:16.6330609Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.utils.ZKPaths.mkdirs(ZKPaths.java:291)
      2021-09-22T08:09:16.6331177Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.NamespaceImpl$1.call(NamespaceImpl.java:90)
      2021-09-22T08:09:16.6331842Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.connection.StandardConnectionHandlingPolicy.callWithRetry(StandardConnectionHandlingPolicy.java:64)
      2021-09-22T08:09:16.6332463Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.RetryLoop.callWithRetry(RetryLoop.java:100)
      2021-09-22T08:09:16.6333051Z Sep 22 08:09:16 	at org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.NamespaceImpl.fixForNamespace(NamespaceImpl.java:83)
      2021-09-22T08:09:16.6333499Z Sep 22 08:09:16 	... 22 more
      
      2021-09-22T08:29:57.2073586Z Sep 22 08:29:57 [ERROR] Failures: 
      2021-09-22T08:29:57.2074275Z Sep 22 08:29:57 [ERROR]   ZooKeeperLeaderElectionConnectionHandlingTest.testLoseLeadershipOnLostConnectionIfTolerateSuspendedConnectionsIsEnabled The TestingFatalErrorHandler caught an exception.
      

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            trohrmann Till Rohrmann
            pnowojski Piotr Nowojski
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment