Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-27075 Migrate CI from Azure to Github Actions
  3. FLINK-33464

JoinITCase.testRightOuterJoin failed due to heartbeat timeout

    XMLWordPrintableJSON

Details

    Description

      https://github.com/XComp/flink/actions/runs/6756936036/job/18367079822#step:12:11525

      Error: 21:46:20 21:46:20.936 [ERROR] Tests run: 196, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 788.192 s <<< FAILURE! - in org.apache.flink.table.planner.runtime.batch.sql.join.JoinITCase
      Error: 21:46:20 21:46:20.936 [ERROR] org.apache.flink.table.planner.runtime.batch.sql.join.JoinITCase.testRightOuterJoin  Time elapsed: 68.118 s  <<< ERROR!
      Nov 04 21:46:20 java.lang.RuntimeException: Failed to fetch next result
      Nov 04 21:46:20 	at org.apache.flink.streaming.api.operators.collect.CollectResultIterator.nextResultFromFetcher(CollectResultIterator.java:118)
      Nov 04 21:46:20 	at org.apache.flink.streaming.api.operators.collect.CollectResultIterator.hasNext(CollectResultIterator.java:89)
      Nov 04 21:46:20 	at org.apache.flink.table.planner.connectors.CollectDynamicSink$CloseableRowIteratorWrapper.hasNext(CollectDynamicSink.java:230)
      Nov 04 21:46:20 	at java.base/java.util.Iterator.forEachRemaining(Iterator.java:132)
      Nov 04 21:46:20 	at org.apache.flink.util.CollectionUtil.iteratorToList(CollectionUtil.java:122)
      Nov 04 21:46:20 	at org.apache.flink.table.planner.runtime.utils.BatchTestBase.executeQuery(BatchTestBase.scala:309)
      Nov 04 21:46:20 	at org.apache.flink.table.planner.runtime.utils.BatchTestBase.check(BatchTestBase.scala:145)
      Nov 04 21:46:20 	at org.apache.flink.table.planner.runtime.utils.BatchTestBase.checkResult(BatchTestBase.scala:109)
      Nov 04 21:46:20 	at org.apache.flink.table.planner.runtime.batch.sql.join.JoinITCase.testRightOuterJoin(JoinITCase.scala:892)
      Nov 04 21:46:20 	at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
      [...]
      Nov 04 21:46:20 	at java.base/java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:183)
      Nov 04 21:46:20 Caused by: java.io.IOException: Failed to fetch job execution result
      Nov 04 21:46:20 	at org.apache.flink.streaming.api.operators.collect.CollectResultFetcher.getAccumulatorResults(CollectResultFetcher.java:187)
      Nov 04 21:46:20 	at org.apache.flink.streaming.api.operators.collect.CollectResultFetcher.next(CollectResultFetcher.java:123)
      Nov 04 21:46:20 	at org.apache.flink.streaming.api.operators.collect.CollectResultIterator.nextResultFromFetcher(CollectResultIterator.java:115)
      Nov 04 21:46:20 	... 105 more
      Nov 04 21:46:20 Caused by: java.util.concurrent.ExecutionException: org.apache.flink.runtime.client.JobExecutionException: Job execution failed.
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:395)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture.get(CompletableFuture.java:2022)
      Nov 04 21:46:20 	at org.apache.flink.streaming.api.operators.collect.CollectResultFetcher.getAccumulatorResults(CollectResultFetcher.java:185)
      Nov 04 21:46:20 	... 107 more
      Nov 04 21:46:20 Caused by: org.apache.flink.runtime.client.JobExecutionException: Job execution failed.
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.JobResult.toJobExecutionResult(JobResult.java:144)
      Nov 04 21:46:20 	at org.apache.flink.runtime.minicluster.MiniClusterJobClient.lambda$getJobExecutionResult$3(MiniClusterJobClient.java:141)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:642)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2073)
      Nov 04 21:46:20 	at org.apache.flink.runtime.rpc.pekko.PekkoInvocationHandler.lambda$invokeRpc$1(PekkoInvocationHandler.java:268)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:859)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:837)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2073)
      Nov 04 21:46:20 	at org.apache.flink.util.concurrent.FutureUtils.doForward(FutureUtils.java:1287)
      Nov 04 21:46:20 	at org.apache.flink.runtime.concurrent.ClassLoadingUtils.lambda$guardCompletionWithContextClassLoader$1(ClassLoadingUtils.java:93)
      Nov 04 21:46:20 	at org.apache.flink.runtime.concurrent.ClassLoadingUtils.runWithContextClassLoader(ClassLoadingUtils.java:68)
      Nov 04 21:46:20 	at org.apache.flink.runtime.concurrent.ClassLoadingUtils.lambda$guardCompletionWithContextClassLoader$2(ClassLoadingUtils.java:92)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:859)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:837)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2073)
      Nov 04 21:46:20 	at org.apache.flink.runtime.concurrent.pekko.ScalaFutureUtils$1.onComplete(ScalaFutureUtils.java:47)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.OnComplete.internal(Future.scala:310)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.OnComplete.internal(Future.scala:307)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.japi$CallbackBridge.apply(Future.scala:234)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.japi$CallbackBridge.apply(Future.scala:231)
      Nov 04 21:46:20 	at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:64)
      Nov 04 21:46:20 	at org.apache.flink.runtime.concurrent.pekko.ScalaFutureUtils$DirectExecutionContext.execute(ScalaFutureUtils.java:65)
      Nov 04 21:46:20 	at scala.concurrent.impl.CallbackRunnable.executeWithValue(Promise.scala:72)
      Nov 04 21:46:20 	at scala.concurrent.impl.Promise$DefaultPromise.$anonfun$tryComplete$1(Promise.scala:288)
      Nov 04 21:46:20 	at scala.concurrent.impl.Promise$DefaultPromise.$anonfun$tryComplete$1$adapted(Promise.scala:288)
      Nov 04 21:46:20 	at scala.concurrent.impl.Promise$DefaultPromise.tryComplete(Promise.scala:288)
      Nov 04 21:46:20 	at org.apache.pekko.pattern.PromiseActorRef.$bang(AskSupport.scala:629)
      Nov 04 21:46:20 	at org.apache.pekko.pattern.PipeToSupport$PipeableFuture$$anonfun$pipeTo$1.applyOrElse(PipeToSupport.scala:34)
      Nov 04 21:46:20 	at org.apache.pekko.pattern.PipeToSupport$PipeableFuture$$anonfun$pipeTo$1.applyOrElse(PipeToSupport.scala:33)
      Nov 04 21:46:20 	at scala.concurrent.Future.$anonfun$andThen$1(Future.scala:536)
      Nov 04 21:46:20 	at scala.concurrent.impl.Promise.liftedTree1$1(Promise.scala:33)
      Nov 04 21:46:20 	at scala.concurrent.impl.Promise.$anonfun$transform$1(Promise.scala:33)
      Nov 04 21:46:20 	at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:64)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.BatchingExecutor$AbstractBatch.processBatch(BatchingExecutor.scala:73)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.BatchingExecutor$BlockableBatch.$anonfun$run$1(BatchingExecutor.scala:110)
      Nov 04 21:46:20 	at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)
      Nov 04 21:46:20 	at scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:85)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.BatchingExecutor$BlockableBatch.run(BatchingExecutor.scala:110)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.TaskInvocation.run(AbstractDispatcher.scala:59)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.ForkJoinExecutorConfigurator$PekkoForkJoinTask.exec(ForkJoinExecutorConfigurator.scala:57)
      Nov 04 21:46:20 	... 5 more
      Nov 04 21:46:20 Caused by: org.apache.flink.runtime.JobException: Recovery is suppressed by NoRestartBackoffTimeStrategy
      Nov 04 21:46:20 	at org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.handleFailure(ExecutionFailureHandler.java:176)
      Nov 04 21:46:20 	at org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.getFailureHandlingResult(ExecutionFailureHandler.java:107)
      Nov 04 21:46:20 	at org.apache.flink.runtime.scheduler.DefaultScheduler.recordTaskFailure(DefaultScheduler.java:285)
      Nov 04 21:46:20 	at org.apache.flink.runtime.scheduler.DefaultScheduler.handleTaskFailure(DefaultScheduler.java:276)
      Nov 04 21:46:20 	at org.apache.flink.runtime.scheduler.DefaultScheduler.onTaskFailed(DefaultScheduler.java:269)
      Nov 04 21:46:20 	at org.apache.flink.runtime.scheduler.SchedulerBase.onTaskExecutionStateUpdate(SchedulerBase.java:765)
      Nov 04 21:46:20 	at org.apache.flink.runtime.scheduler.SchedulerBase.updateTaskExecutionState(SchedulerBase.java:742)
      Nov 04 21:46:20 	at org.apache.flink.runtime.scheduler.UpdateSchedulerNgOnInternalFailuresListener.notifyTaskFailure(UpdateSchedulerNgOnInternalFailuresListener.java:51)
      Nov 04 21:46:20 	at org.apache.flink.runtime.executiongraph.DefaultExecutionGraph.notifySchedulerNgAboutInternalTaskFailure(DefaultExecutionGraph.java:1645)
      Nov 04 21:46:20 	at org.apache.flink.runtime.executiongraph.Execution.processFail(Execution.java:1144)
      Nov 04 21:46:20 	at org.apache.flink.runtime.executiongraph.Execution.processFail(Execution.java:1084)
      Nov 04 21:46:20 	at org.apache.flink.runtime.executiongraph.Execution.fail(Execution.java:785)
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.slotpool.SingleLogicalSlot.signalPayloadRelease(SingleLogicalSlot.java:195)
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.slotpool.SingleLogicalSlot.release(SingleLogicalSlot.java:182)
      Nov 04 21:46:20 	at org.apache.flink.runtime.scheduler.SimpleExecutionSlotAllocator$LogicalSlotHolder.release(SimpleExecutionSlotAllocator.java:203)
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.slotpool.AllocatedSlot.releasePayload(AllocatedSlot.java:152)
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.slotpool.DefaultDeclarativeSlotPool.releasePayload(DefaultDeclarativeSlotPool.java:482)
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.slotpool.DefaultDeclarativeSlotPool.freeAndReleaseSlots(DefaultDeclarativeSlotPool.java:474)
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.slotpool.DefaultDeclarativeSlotPool.releaseSlots(DefaultDeclarativeSlotPool.java:445)
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.slotpool.DeclarativeSlotPoolService.internalReleaseTaskManager(DeclarativeSlotPoolService.java:275)
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.slotpool.DeclarativeSlotPoolService.releaseTaskManager(DeclarativeSlotPoolService.java:231)
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.JobMaster.disconnectTaskManager(JobMaster.java:549)
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.JobMaster$TaskManagerHeartbeatListener.handleTaskManagerConnectionLoss(JobMaster.java:1469)
      Nov 04 21:46:20 	at org.apache.flink.runtime.jobmaster.JobMaster$TaskManagerHeartbeatListener.notifyHeartbeatTimeout(JobMaster.java:1464)
      Nov 04 21:46:20 	at org.apache.flink.runtime.heartbeat.DefaultHeartbeatMonitor.run(DefaultHeartbeatMonitor.java:158)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
      Nov 04 21:46:20 	at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
      Nov 04 21:46:20 	at org.apache.flink.runtime.rpc.pekko.PekkoRpcActor.lambda$handleRunAsync$4(PekkoRpcActor.java:451)
      Nov 04 21:46:20 	at org.apache.flink.runtime.concurrent.ClassLoadingUtils.runWithContextClassLoader(ClassLoadingUtils.java:68)
      Nov 04 21:46:20 	at org.apache.flink.runtime.rpc.pekko.PekkoRpcActor.handleRunAsync(PekkoRpcActor.java:451)
      Nov 04 21:46:20 	at org.apache.flink.runtime.rpc.pekko.PekkoRpcActor.handleRpcMessage(PekkoRpcActor.java:218)
      Nov 04 21:46:20 	at org.apache.flink.runtime.rpc.pekko.FencedPekkoRpcActor.handleRpcMessage(FencedPekkoRpcActor.java:85)
      Nov 04 21:46:20 	at org.apache.flink.runtime.rpc.pekko.PekkoRpcActor.handleMessage(PekkoRpcActor.java:168)
      Nov 04 21:46:20 	at org.apache.pekko.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:33)
      Nov 04 21:46:20 	at org.apache.pekko.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:29)
      Nov 04 21:46:20 	at scala.PartialFunction.applyOrElse(PartialFunction.scala:127)
      Nov 04 21:46:20 	at scala.PartialFunction.applyOrElse$(PartialFunction.scala:126)
      Nov 04 21:46:20 	at org.apache.pekko.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:29)
      Nov 04 21:46:20 	at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:175)
      Nov 04 21:46:20 	at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:176)
      Nov 04 21:46:20 	at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:176)
      Nov 04 21:46:20 	at org.apache.pekko.actor.Actor.aroundReceive(Actor.scala:547)
      Nov 04 21:46:20 	at org.apache.pekko.actor.Actor.aroundReceive$(Actor.scala:545)
      Nov 04 21:46:20 	at org.apache.pekko.actor.AbstractActor.aroundReceive(AbstractActor.scala:229)
      Nov 04 21:46:20 	at org.apache.pekko.actor.ActorCell.receiveMessage(ActorCell.scala:590)
      Nov 04 21:46:20 	at org.apache.pekko.actor.ActorCell.invoke(ActorCell.scala:557)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.Mailbox.processMailbox(Mailbox.scala:280)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.Mailbox.run(Mailbox.scala:241)
      Nov 04 21:46:20 	at org.apache.pekko.dispatch.Mailbox.exec(Mailbox.scala:253)
      Nov 04 21:46:20 	... 5 more
      Nov 04 21:46:20 Caused by: java.util.concurrent.TimeoutException: Heartbeat of TaskManager with id 0b0b8b9b-fa3b-4ce7-bd82-0bfdaf85ac79 timed out.
      Nov 04 21:46:20 	... 31 more
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            mapohl Matthias Pohl
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: