Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-17006

Various integration tests in table module fail with FileNotFoundException (in Rocksdb)

    XMLWordPrintableJSON

Details

    Description

      AggregateITCase.testDistinctGroupBy fails with FileNotFoundException (in Rocksdb)

      CI run: https://dev.azure.com/rmetzger/Flink/_build/results?buildId=7045&view=logs&j=e25d5e7e-2a9c-5589-4940-0b638d75a414&t=294c2388-20e6-57a2-5721-91db544b1e69
      Log output:

      2020-04-03T17:17:44.4036304Z [ERROR] Tests run: 234, Failures: 0, Errors: 1, Skipped: 6, Time elapsed: 155.577 s <<< FAILURE! - in org.apache.flink.table.planner.runtime.stream.sql.AggregateITCase
      2020-04-03T17:17:44.4038781Z [ERROR] testDistinctGroupBy[LocalGlobal=OFF, MiniBatch=ON, StateBackend=ROCKSDB](org.apache.flink.table.planner.runtime.stream.sql.AggregateITCase)  Time elapsed: 0.456 s  <<< ERROR!
      2020-04-03T17:17:44.4040384Z org.apache.flink.runtime.client.JobExecutionException: Job execution failed.
      2020-04-03T17:17:44.4041520Z 	at org.apache.flink.runtime.jobmaster.JobResult.toJobExecutionResult(JobResult.java:147)
      2020-04-03T17:17:44.4042712Z 	at org.apache.flink.runtime.minicluster.MiniCluster.executeJobBlocking(MiniCluster.java:659)
      2020-04-03T17:17:44.4043972Z 	at org.apache.flink.streaming.util.TestStreamEnvironment.execute(TestStreamEnvironment.java:77)
      2020-04-03T17:17:44.4045540Z 	at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:1644)
      2020-04-03T17:17:44.4047015Z 	at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:1626)
      2020-04-03T17:17:44.4048576Z 	at org.apache.flink.streaming.api.scala.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.scala:673)
      2020-04-03T17:17:44.4050073Z 	at org.apache.flink.table.planner.runtime.stream.sql.AggregateITCase.testDistinctGroupBy(AggregateITCase.scala:172)
      2020-04-03T17:17:44.4051200Z 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
      2020-04-03T17:17:44.4052171Z 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
      2020-04-03T17:17:44.4053308Z 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
      2020-04-03T17:17:44.4054322Z 	at java.lang.reflect.Method.invoke(Method.java:498)
      2020-04-03T17:17:44.4055410Z 	at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
      2020-04-03T17:17:44.4056570Z 	at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
      2020-04-03T17:17:44.4057800Z 	at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
      2020-04-03T17:17:44.4059019Z 	at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
      2020-04-03T17:17:44.4060178Z 	at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
      2020-04-03T17:17:44.4061261Z 	at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
      2020-04-03T17:17:44.4062617Z 	at org.junit.rules.ExpectedException$ExpectedExceptionStatement.evaluate(ExpectedException.java:239)
      2020-04-03T17:17:44.4063782Z 	at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48)
      2020-04-03T17:17:44.4064838Z 	at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55)
      2020-04-03T17:17:44.4065742Z 	at org.junit.rules.RunRules.evaluate(RunRules.java:20)
      2020-04-03T17:17:44.4066636Z 	at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
      2020-04-03T17:17:44.4067762Z 	at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
      2020-04-03T17:17:44.4068895Z 	at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
      2020-04-03T17:17:44.4069978Z 	at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
      2020-04-03T17:17:44.4070920Z 	at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
      2020-04-03T17:17:44.4071901Z 	at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
      2020-04-03T17:17:44.4072875Z 	at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
      2020-04-03T17:17:44.4073850Z 	at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
      2020-04-03T17:17:44.4074854Z 	at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
      2020-04-03T17:17:44.4075729Z 	at org.junit.runners.Suite.runChild(Suite.java:128)
      2020-04-03T17:17:44.4076541Z 	at org.junit.runners.Suite.runChild(Suite.java:27)
      2020-04-03T17:17:44.4077479Z 	at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
      2020-04-03T17:17:44.4078422Z 	at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
      2020-04-03T17:17:44.4079501Z 	at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
      2020-04-03T17:17:44.4080503Z 	at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
      2020-04-03T17:17:44.4081483Z 	at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
      2020-04-03T17:17:44.4082477Z 	at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48)
      2020-04-03T17:17:44.4083522Z 	at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48)
      2020-04-03T17:17:44.4084529Z 	at org.junit.rules.RunRules.evaluate(RunRules.java:20)
      2020-04-03T17:17:44.4085420Z 	at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
      2020-04-03T17:17:44.4086433Z 	at org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
      2020-04-03T17:17:44.4087696Z 	at org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:273)
      2020-04-03T17:17:44.4088900Z 	at org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:238)
      2020-04-03T17:17:44.4090109Z 	at org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
      2020-04-03T17:17:44.4091331Z 	at org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:384)
      2020-04-03T17:17:44.4092600Z 	at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:345)
      2020-04-03T17:17:44.4093737Z 	at org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:126)
      2020-04-03T17:17:44.4094894Z 	at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:418)
      2020-04-03T17:17:44.4096257Z Caused by: org.apache.flink.runtime.JobException: Recovery is suppressed by FixedDelayRestartBackoffTimeStrategy(maxNumberRestartAttempts=1, backoffTimeMS=0)
      2020-04-03T17:17:44.4097915Z 	at org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.handleFailure(ExecutionFailureHandler.java:110)
      2020-04-03T17:17:44.4099539Z 	at org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.getFailureHandlingResult(ExecutionFailureHandler.java:76)
      2020-04-03T17:17:44.4101039Z 	at org.apache.flink.runtime.scheduler.DefaultScheduler.handleTaskFailure(DefaultScheduler.java:190)
      2020-04-03T17:17:44.4102353Z 	at org.apache.flink.runtime.scheduler.DefaultScheduler.maybeHandleTaskFailure(DefaultScheduler.java:184)
      2020-04-03T17:17:44.4103808Z 	at org.apache.flink.runtime.scheduler.DefaultScheduler.updateTaskExecutionStateInternal(DefaultScheduler.java:178)
      2020-04-03T17:17:44.4105242Z 	at org.apache.flink.runtime.scheduler.SchedulerBase.updateTaskExecutionState(SchedulerBase.java:505)
      2020-04-03T17:17:44.4106478Z 	at org.apache.flink.runtime.jobmaster.JobMaster.updateTaskExecutionState(JobMaster.java:384)
      2020-04-03T17:17:44.4107561Z 	at sun.reflect.GeneratedMethodAccessor16.invoke(Unknown Source)
      2020-04-03T17:17:44.4108552Z 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
      2020-04-03T17:17:44.4109560Z 	at java.lang.reflect.Method.invoke(Method.java:498)
      2020-04-03T17:17:44.4110604Z 	at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcInvocation(AkkaRpcActor.java:284)
      2020-04-03T17:17:44.4111812Z 	at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:199)
      2020-04-03T17:17:44.4113054Z 	at org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:74)
      2020-04-03T17:17:44.4114282Z 	at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(AkkaRpcActor.java:152)
      2020-04-03T17:17:44.4115417Z 	at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26)
      2020-04-03T17:17:44.4116357Z 	at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21)
      2020-04-03T17:17:44.4117338Z 	at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123)
      2020-04-03T17:17:44.4118401Z 	at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21)
      2020-04-03T17:17:44.4119414Z 	at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170)
      2020-04-03T17:17:44.4120443Z 	at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
      2020-04-03T17:17:44.4121538Z 	at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
      2020-04-03T17:17:44.4122461Z 	at akka.actor.Actor$class.aroundReceive(Actor.scala:517)
      2020-04-03T17:17:44.4123383Z 	at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225)
      2020-04-03T17:17:44.4124315Z 	at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592)
      2020-04-03T17:17:44.4125257Z 	at akka.actor.ActorCell.invoke(ActorCell.scala:561)
      2020-04-03T17:17:44.4126107Z 	at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258)
      2020-04-03T17:17:44.4126953Z 	at akka.dispatch.Mailbox.run(Mailbox.scala:225)
      2020-04-03T17:17:44.4127798Z 	at akka.dispatch.Mailbox.exec(Mailbox.scala:235)
      2020-04-03T17:17:44.4128829Z 	at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
      2020-04-03T17:17:44.4129875Z 	at akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
      2020-04-03T17:17:44.4130957Z 	at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
      2020-04-03T17:17:44.4132016Z 	at akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
      2020-04-03T17:17:44.4133098Z Caused by: java.lang.Exception: Exception while creating StreamOperatorStateContext.
      2020-04-03T17:17:44.4134528Z 	at org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.streamOperatorStateContext(StreamTaskStateInitializerImpl.java:191)
      2020-04-03T17:17:44.4136100Z 	at org.apache.flink.streaming.api.operators.AbstractStreamOperator.initializeState(AbstractStreamOperator.java:246)
      2020-04-03T17:17:44.4137580Z 	at org.apache.flink.streaming.runtime.tasks.OperatorChain.initializeStateAndOpenOperators(OperatorChain.java:293)
      2020-04-03T17:17:44.4138925Z 	at org.apache.flink.streaming.runtime.tasks.StreamTask.lambda$beforeInvoke$0(StreamTask.java:436)
      2020-04-03T17:17:44.4140304Z 	at org.apache.flink.streaming.runtime.tasks.StreamTaskActionExecutor$1.runThrowing(StreamTaskActionExecutor.java:47)
      2020-04-03T17:17:44.4141631Z 	at org.apache.flink.streaming.runtime.tasks.StreamTask.beforeInvoke(StreamTask.java:432)
      2020-04-03T17:17:44.4142783Z 	at org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:445)
      2020-04-03T17:17:44.4143814Z 	at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:718)
      2020-04-03T17:17:44.4144935Z 	at org.apache.flink.runtime.taskmanager.Task.run(Task.java:542)
      2020-04-03T17:17:44.4145756Z 	at java.lang.Thread.run(Thread.java:748)
      2020-04-03T17:17:44.4147095Z Caused by: org.apache.flink.util.FlinkException: Could not restore keyed state backend for KeyedMapBundleOperator_f6dc7f4d2283f4605b127b9364e21148_(2/4) from any of the 1 provided restore options.
      2020-04-03T17:17:44.4148863Z 	at org.apache.flink.streaming.api.operators.BackendRestorerProcedure.createAndRestore(BackendRestorerProcedure.java:135)
      2020-04-03T17:17:44.4150449Z 	at org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.keyedStatedBackend(StreamTaskStateInitializerImpl.java:304)
      2020-04-03T17:17:44.4152096Z 	at org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.streamOperatorStateContext(StreamTaskStateInitializerImpl.java:131)
      2020-04-03T17:17:44.4153151Z 	... 9 more
      2020-04-03T17:17:44.4153946Z Caused by: org.apache.flink.runtime.state.BackendBuildingException: Caught unexpected exception.
      2020-04-03T17:17:44.4155379Z 	at org.apache.flink.contrib.streaming.state.RocksDBKeyedStateBackendBuilder.build(RocksDBKeyedStateBackendBuilder.java:336)
      2020-04-03T17:17:44.4156850Z 	at org.apache.flink.contrib.streaming.state.RocksDBStateBackend.createKeyedStateBackend(RocksDBStateBackend.java:548)
      2020-04-03T17:17:44.4158503Z 	at org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.lambda$keyedStatedBackend$1(StreamTaskStateInitializerImpl.java:288)
      2020-04-03T17:17:44.4160158Z 	at org.apache.flink.streaming.api.operators.BackendRestorerProcedure.attemptCreateAndRestore(BackendRestorerProcedure.java:142)
      2020-04-03T17:17:44.4161662Z 	at org.apache.flink.streaming.api.operators.BackendRestorerProcedure.createAndRestore(BackendRestorerProcedure.java:121)
      2020-04-03T17:17:44.4162706Z 	... 11 more
      2020-04-03T17:17:44.4164646Z Caused by: java.io.FileNotFoundException: /tmp/junit1553841028375950249/junit3479071836389613442/babdd750dc1c5b3874a0dd55d14a84f6/shared/2aa67d1b-8841-4755-84c4-b891fc8c3352 (No such file or directory)
      2020-04-03T17:17:44.4166003Z 	at java.io.FileInputStream.open0(Native Method)
      2020-04-03T17:17:44.4166782Z 	at java.io.FileInputStream.open(FileInputStream.java:195)
      2020-04-03T17:17:44.4167752Z 	at java.io.FileInputStream.<init>(FileInputStream.java:138)
      2020-04-03T17:17:44.4168802Z 	at org.apache.flink.core.fs.local.LocalDataInputStream.<init>(LocalDataInputStream.java:50)
      2020-04-03T17:17:44.4170003Z 	at org.apache.flink.core.fs.local.LocalFileSystem.open(LocalFileSystem.java:142)
      2020-04-03T17:17:44.4171168Z 	at org.apache.flink.core.fs.SafetyNetWrapperFileSystem.open(SafetyNetWrapperFileSystem.java:85)
      2020-04-03T17:17:44.4172447Z 	at org.apache.flink.runtime.state.filesystem.FileStateHandle.openInputStream(FileStateHandle.java:68)
      2020-04-03T17:17:44.4173862Z 	at org.apache.flink.contrib.streaming.state.RocksDBStateDownloader.downloadDataForStateHandle(RocksDBStateDownloader.java:126)
      2020-04-03T17:17:44.4175519Z 	at org.apache.flink.contrib.streaming.state.RocksDBStateDownloader.lambda$createDownloadRunnables$0(RocksDBStateDownloader.java:109)
      2020-04-03T17:17:44.4176945Z 	at org.apache.flink.util.function.ThrowingRunnable.lambda$unchecked$0(ThrowingRunnable.java:50)
      2020-04-03T17:17:44.4178185Z 	at java.util.concurrent.CompletableFuture$AsyncRun.run(CompletableFuture.java:1640)
      2020-04-03T17:17:44.4179404Z 	at org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:211)
      2020-04-03T17:17:44.4180647Z 	at java.util.concurrent.CompletableFuture.asyncRunStage(CompletableFuture.java:1654)
      2020-04-03T17:17:44.4181769Z 	at java.util.concurrent.CompletableFuture.runAsync(CompletableFuture.java:1871)
      2020-04-03T17:17:44.4183098Z 	at org.apache.flink.contrib.streaming.state.RocksDBStateDownloader.downloadDataForAllStateHandles(RocksDBStateDownloader.java:83)
      2020-04-03T17:17:44.4184752Z 	at org.apache.flink.contrib.streaming.state.RocksDBStateDownloader.transferAllStateDataToDirectory(RocksDBStateDownloader.java:67)
      2020-04-03T17:17:44.4186598Z 	at org.apache.flink.contrib.streaming.state.restore.RocksDBIncrementalRestoreOperation.transferRemoteStateToLocalDirectory(RocksDBIncrementalRestoreOperation.java:229)
      2020-04-03T17:17:44.4188548Z 	at org.apache.flink.contrib.streaming.state.restore.RocksDBIncrementalRestoreOperation.restoreFromRemoteState(RocksDBIncrementalRestoreOperation.java:194)
      2020-04-03T17:17:44.4190380Z 	at org.apache.flink.contrib.streaming.state.restore.RocksDBIncrementalRestoreOperation.restoreWithoutRescaling(RocksDBIncrementalRestoreOperation.java:168)
      2020-04-03T17:17:44.4192129Z 	at org.apache.flink.contrib.streaming.state.restore.RocksDBIncrementalRestoreOperation.restore(RocksDBIncrementalRestoreOperation.java:154)
      2020-04-03T17:17:44.4193725Z 	at org.apache.flink.contrib.streaming.state.RocksDBKeyedStateBackendBuilder.build(RocksDBKeyedStateBackendBuilder.java:279)
      2020-04-03T17:17:44.4194758Z 	... 15 more
      2020-04-03T17:17:44.4195053Z 
      

      I'm uncertain about the component assignment of this ticket. This error can probably have many causes?

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              rmetzger Robert Metzger
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: