Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-6289

Running a join on two Cassandra tables using FlinkRunner fails

Details

    • Bug
    • Status: Resolved
    • P1
    • Resolution: Workaround
    • 2.8.0, 2.9.0
    • Not applicable

    Description

      Can't make a simple join on two Cassandra tables when using FlinkRunner.

      The same code works with a DirectRunner fails when used with FlinkRunner giving these (as well as many other) errors:

      Caused by: akka.pattern.AskTimeoutException: Ask timed out on [Actor[akka://flink/user/dispatchere1f5abe7-6299-43ea-9182-24a2193e078f#-1757043920]] after [10000 ms]. Sender[null] sent message of type "org.apache.flink.runtime.rpc.messages.LocalFencedMessage".
          at akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:604)
          at akka.actor.Scheduler$$anon$4.run(Scheduler.scala:126)
          at scala.concurrent.Future$InternalCallbackExecutor$.unbatchedExecute(Future.scala:601)
          at scala.concurrent.BatchingExecutor$class.execute(BatchingExecutor.scala:109)
          at scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:599)
          at akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(LightArrayRevolverScheduler.scala:329)
          at akka.actor.LightArrayRevolverScheduler$$anon$4.executeBucket$1(LightArrayRevolverScheduler.scala:280)
          at akka.actor.LightArrayRevolverScheduler$$anon$4.nextTick(LightArrayRevolverScheduler.scala:284)
          at akka.actor.LightArrayRevolverScheduler$$anon$4.run(LightArrayRevolverScheduler.scala:236)
          at java.lang.Thread.run(Thread.java:748)
      

       

      The code can be found here

      Steps to reproduce:

      1. Clone the repo to a linux (I;m on Ubuntu 18 but any *nix system would probably work - i.e. repl.it)
      2. Follow the README to set up a Cassandra container + schema
      3. Run with 
        gradle --console=plain join-from-cassandra -Drunner=flink > output/build.log 2>&1

        to use FlinkRunner. See error in log at ./output/build.log

      1. Run with 
        gradle --console=plain join-from-cassandra -Drunner=direct > output/build.log 2>&1

        to use DirectRunner. See error in log at ./output/build.log

      Attachments

        1. direct_runner_build.log
          9 kB
          Shahar Frank
        2. flink_runner_build.log
          147 kB
          Shahar Frank

        Activity

          People

            mxm Maximilian Michels
            srfrnk Shahar Frank
            Votes:
            1 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: