Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-26558

Errors when reporting on Job status

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • 1.14.3
    • None
    • Runtime / REST
    • None

    Description

      This error is showing up very frequently in my JobManager logs since I upgraded from 1.14.2 to 1.14.3. The Flink Rest dashboard also fails to load either Running or Completed jobs when this happens.

      The Job Managers are HA, running on Kubernetes. The Task managers are running on Kubernetes as well, and were also upgraded to 1.14.3.

       

      Please advise

       
      2022-03-09 22:12:40,925 ERROR org.apache.flink.runtime.rest.handler.job.JobsOverviewHandler [] - Unhandled exception.
      org.apache.flink.runtime.rpc.akka.exceptions.AkkaRpcException: Failed to serialize the result for RPC call : requestMultipleJobDetails.
      at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.serializeRemoteResultAndVerifySize(AkkaRpcActor.java:417) ~[?:?]
      at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.lambda$sendAsyncResponse$2(AkkaRpcActor.java:373) ~[?:?]
      at java.util.concurrent.CompletableFuture.uniHandle(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture$UniHandle.tryFire(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture.postComplete(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture.complete(Unknown Source) ~[?:?]
      at org.apache.flink.util.concurrent.FutureUtils$ResultConjunctFuture.handleCompletedFuture(FutureUtils.java:858) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at org.apache.flink.util.concurrent.FutureUtils$ResultConjunctFuture.lambda$new$0(FutureUtils.java:876) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at java.util.concurrent.CompletableFuture.uniWhenComplete(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture.postComplete(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture.complete(Unknown Source) ~[?:?]
      at org.apache.flink.runtime.rpc.akka.AkkaInvocationHandler.lambda$invokeRpc$1(AkkaInvocationHandler.java:258) ~[?:?]
      at java.util.concurrent.CompletableFuture.uniWhenComplete(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture.postComplete(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture.complete(Unknown Source) ~[?:?]
      at org.apache.flink.util.concurrent.FutureUtils.doForward(FutureUtils.java:1389) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at org.apache.flink.runtime.concurrent.akka.ClassLoadingUtils.lambda$null$1(ClassLoadingUtils.java:93) ~[?:?]
      at org.apache.flink.runtime.concurrent.akka.ClassLoadingUtils.runWithContextClassLoader(ClassLoadingUtils.java:68) ~[?:?]
      at org.apache.flink.runtime.concurrent.akka.ClassLoadingUtils.lambda$guardCompletionWithContextClassLoader$2(ClassLoadingUtils.java:92) ~[?:?]
      at java.util.concurrent.CompletableFuture.uniWhenComplete(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture.postComplete(Unknown Source) ~[?:?]
      at java.util.concurrent.CompletableFuture.complete(Unknown Source) ~[?:?]
      at org.apache.flink.runtime.concurrent.akka.AkkaFutureUtils$1.onComplete(AkkaFutureUtils.java:47) ~[?:?]
      at akka.dispatch.OnComplete.internal(Future.scala:300) ~[?:?]
      at akka.dispatch.OnComplete.internal(Future.scala:297) ~[?:?]
      at akka.dispatch.japi$CallbackBridge.apply(Future.scala:224) ~[?:?]
      at akka.dispatch.japi$CallbackBridge.apply(Future.scala:221) ~[?:?]
      at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:60) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at org.apache.flink.runtime.concurrent.akka.AkkaFutureUtils$DirectExecutionContext.execute(AkkaFutureUtils.java:65) ~[?:?]
      at scala.concurrent.impl.CallbackRunnable.executeWithValue(Promise.scala:68) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at scala.concurrent.impl.Promise$DefaultPromise.$anonfun$tryComplete$1(Promise.scala:284) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at scala.concurrent.impl.Promise$DefaultPromise.$anonfun$tryComplete$1$adapted(Promise.scala:284) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at scala.concurrent.impl.Promise$DefaultPromise.tryComplete(Promise.scala:284) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at akka.pattern.PromiseActorRef.$bang(AskSupport.scala:621) ~[?:?]
      at akka.pattern.PipeToSupport$PipeableFuture$$anonfun$pipeTo$1.applyOrElse(PipeToSupport.scala:24) ~[?:?]
      at akka.pattern.PipeToSupport$PipeableFuture$$anonfun$pipeTo$1.applyOrElse(PipeToSupport.scala:23) ~[?:?]
      at scala.concurrent.Future.$anonfun$andThen$1(Future.scala:532) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at scala.concurrent.impl.Promise.liftedTree1$1(Promise.scala:29) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at scala.concurrent.impl.Promise.$anonfun$transform$1(Promise.scala:29) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:60) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at akka.dispatch.BatchingExecutor$AbstractBatch.processBatch(BatchingExecutor.scala:63) ~[?:?]
      at akka.dispatch.BatchingExecutor$BlockableBatch.$anonfun$run$1(BatchingExecutor.scala:100) ~[?:?]
      at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:12) ~[?:?]
      at scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:81) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at akka.dispatch.BatchingExecutor$BlockableBatch.run(BatchingExecutor.scala:100) ~[?:?]
      at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:49) ~[?:?]
      at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(ForkJoinExecutorConfigurator.scala:48) ~[?:?]
      at java.util.concurrent.ForkJoinTask.doExec(Unknown Source) [?:?]
      at java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(Unknown Source) [?:?]
      at java.util.concurrent.ForkJoinPool.scan(Unknown Source) [?:?]
      at java.util.concurrent.ForkJoinPool.runWorker(Unknown Source) [?:?]

      at java.util.concurrent.ForkJoinWorkerThread.run(Unknown Source) Caused by: java.io.NotSerializableException: java.util.HashMap$Values
      at java.io.ObjectOutputStream.writeObject0(Unknown Source) ~[?:?]
      at java.io.ObjectOutputStream.defaultWriteFields(Unknown Source) ~[?:?]
      at java.io.ObjectOutputStream.writeSerialData(Unknown Source) ~[?:?]
      at java.io.ObjectOutputStream.writeOrdinaryObject(Unknown Source) ~[?:?]
      at java.io.ObjectOutputStream.writeObject0(Unknown Source) ~[?:?]
      at java.io.ObjectOutputStream.writeObject(Unknown Source) ~[?:?]
      at org.apache.flink.util.InstantiationUtil.serializeObject(InstantiationUtil.java:632) ~[flink-dist_2.11-1.14.3.jar:1.14.3]
      at org.apache.flink.runtime.rpc.akka.AkkaRpcSerializedValue.valueOf(AkkaRpcSerializedValue.java:66) ~[?:?]
      at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.serializeRemoteResultAndVerifySize(AkkaRpcActor.java:400) ~[?:?]
      ... 54 more[?:?]

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jeremy.degroot Jeremy DeGroot
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: