Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-26833

missing link to setup-pyflink-virtual-env.sh / error during deploy

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.14.4
    • None
    • API / Python, Documentation
    • None

    Description

      When I navigate to https://nightlies.apache.org/flink/flink-docs-release-1.14/docs/dev/python/faq/#preparing-python-virtual-environment there is not a functioning link to the `setup-pyflink-virtual-env.sh` script.

       

      When I try to deploy to a remote cluster with a virtualenv that is created by pycharm I see the following error:

       

       

      (venv) chris@chrisvb~/PycharmProjects/mythril_pyflink$ /home/chris/PycharmProjects/mythril_pyflink/venv/lib/python3.8/site-packages/pyflink/bin/flink run --jobmanager flinkmaster.myxyzdomain.com:8081 --python timescale_profit_calc_stream.py
      WARNING: An illegal reflective access operation has occurred
      WARNING: Illegal reflective access by org.apache.flink.api.java.ClosureCleaner (file:/home/chris/PycharmProjects/mythril_pyflink/venv/lib/python3.8/site-packages/pyflink/lib/flink-dist_2.11-1.14.3.jar) to field java.util.Properties.serialVersionUID
      WARNING: Please consider reporting this to the maintainers of org.apache.flink.api.java.ClosureCleaner
      WARNING: Use --illegal-access=warn to enable warnings of further illegal reflective access operations
      WARNING: All illegal access operations will be denied in a future release
      Job has been submitted with JobID fea43e68c13d0cc3183a6ad6d6157748
      Traceback (most recent call last):
        File "timescale_profit_calc_stream.py", line 169, in <module>
          execute_timescale_profit_calc_stream()
        File "timescale_profit_calc_stream.py", line 157, in execute_timescale_profit_calc_stream
          env.execute()
        File "/home/chris/PycharmProjects/mythril_pyflink/venv/lib/python3.8/site-packages/pyflink/datastream/stream_execution_environment.py", line 691, in execute
          return JobExecutionResult(self._j_stream_execution_environment.execute(j_stream_graph))
        File "/home/chris/PycharmProjects/mythril_pyflink/venv/lib/python3.8/site-packages/py4j/java_gateway.py", line 1285, in _call_
          return_value = get_return_value(
        File "/home/chris/PycharmProjects/mythril_pyflink/venv/lib/python3.8/site-packages/pyflink/util/exceptions.py", line 146, in deco
          return f(*a, **kw)
        File "/home/chris/PycharmProjects/mythril_pyflink/venv/lib/python3.8/site-packages/py4j/protocol.py", line 326, in get_return_value
          raise Py4JJavaError(
      py4j.protocol.Py4JJavaError: An error occurred while calling o10.execute.
      : java.util.concurrent.ExecutionException: org.apache.flink.client.program.ProgramInvocationException: Job failed (JobID: fea43e68c13d0cc3183a6ad6d6157748)
              at java.base/java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:395)
              at java.base/java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1999)
              at org.apache.flink.client.program.StreamContextEnvironment.getJobExecutionResult(StreamContextEnvironment.java:123)
              at org.apache.flink.client.program.StreamContextEnvironment.execute(StreamContextEnvironment.java:80)
              at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
              at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
              at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
              at java.base/java.lang.reflect.Method.invoke(Method.java:566)
              at org.apache.flink.api.python.shaded.py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:244)
              at org.apache.flink.api.python.shaded.py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:357)
              at org.apache.flink.api.python.shaded.py4j.Gateway.invoke(Gateway.java:282)
              at org.apache.flink.api.python.shaded.py4j.commands.AbstractCommand.invokeMethod(AbstractCommand.java:132)
              at org.apache.flink.api.python.shaded.py4j.commands.CallCommand.execute(CallCommand.java:79)
              at org.apache.flink.api.python.shaded.py4j.GatewayConnection.run(GatewayConnection.java:238)
              at java.base/java.lang.Thread.run(Thread.java:829)
      Caused by: org.apache.flink.client.program.ProgramInvocationException: Job failed (JobID: fea43e68c13d0cc3183a6ad6d6157748)
              at org.apache.flink.client.deployment.ClusterClientJobClientAdapter.lambda$null$6(ClusterClientJobClientAdapter.java:125)
              at java.base/java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:642)
              at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506)
              at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2073)
              at org.apache.flink.util.concurrent.FutureUtils.lambda$retryOperationWithDelay$9(FutureUtils.java:403)
              at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:859)
              at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:837)
              at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506)
              at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2073)
              at org.apache.flink.client.program.rest.RestClusterClient.lambda$pollResourceAsync$26(RestClusterClient.java:698)
              at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:859)
              at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:837)
              at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506)
              at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2073)
              at org.apache.flink.util.concurrent.FutureUtils.lambda$retryOperationWithDelay$9(FutureUtils.java:403)
              at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:859)
              at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:837)
              at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506)
              at java.base/java.util.concurrent.CompletableFuture.postFire(CompletableFuture.java:610)
              at java.base/java.util.concurrent.CompletableFuture$UniCompose.tryFire(CompletableFuture.java:1085)
              at java.base/java.util.concurrent.CompletableFuture$Completion.run(CompletableFuture.java:478)
              at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
              at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
              ... 1 more
      Caused by: org.apache.flink.runtime.client.JobExecutionException: Job execution failed.
              at org.apache.flink.runtime.jobmaster.JobResult.toJobExecutionResult(JobResult.java:144)
              at org.apache.flink.client.deployment.ClusterClientJobClientAdapter.lambda$null$6(ClusterClientJobClientAdapter.java:123)
              ... 23 more
      Caused by: org.apache.flink.runtime.JobException: Recovery is suppressed by NoRestartBackoffTimeStrategy
              at org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.handleFailure(ExecutionFailureHandler.java:138)
              at org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.getFailureHandlingResult(ExecutionFailureHandler.java:82)
              at org.apache.flink.runtime.scheduler.DefaultScheduler.handleTaskFailure(DefaultScheduler.java:252)
              at org.apache.flink.runtime.scheduler.DefaultScheduler.maybeHandleTaskFailure(DefaultScheduler.java:242)
              at org.apache.flink.runtime.scheduler.DefaultScheduler.updateTaskExecutionStateInternal(DefaultScheduler.java:233)
              at org.apache.flink.runtime.scheduler.SchedulerBase.updateTaskExecutionState(SchedulerBase.java:684)
              at org.apache.flink.runtime.scheduler.SchedulerNG.updateTaskExecutionState(SchedulerNG.java:79)
              at org.apache.flink.runtime.jobmaster.JobMaster.updateTaskExecutionState(JobMaster.java:444)
              at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
              at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
              at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
              at java.base/java.lang.reflect.Method.invoke(Method.java:566)
              at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.lambda$handleRpcInvocation$1(AkkaRpcActor.java:316)
              at org.apache.flink.runtime.concurrent.akka.ClassLoadingUtils.runWithContextClassLoader(ClassLoadingUtils.java:83)
              at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcInvocation(AkkaRpcActor.java:314)
              at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:217)
              at org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:78)
              at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(AkkaRpcActor.java:163)
              at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:24)
              at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:20)
              at scala.PartialFunction.applyOrElse(PartialFunction.scala:123)
              at scala.PartialFunction.applyOrElse$(PartialFunction.scala:122)
              at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:20)
              at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
              at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172)
              at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172)
              at akka.actor.Actor.aroundReceive(Actor.scala:537)
              at akka.actor.Actor.aroundReceive$(Actor.scala:535)
              at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:220)
              at akka.actor.ActorCell.receiveMessage(ActorCell.scala:580)
              at akka.actor.ActorCell.invoke(ActorCell.scala:548)
              at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:270)
              at akka.dispatch.Mailbox.run(Mailbox.scala:231)
              at akka.dispatch.Mailbox.exec(Mailbox.scala:243)
              at java.base/java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:290)
              at java.base/java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(ForkJoinPool.java:1020)
              at java.base/java.util.concurrent.ForkJoinPool.scan(ForkJoinPool.java:1656)
              at java.base/java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1594)
              at java.base/java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:183)
      Caused by: java.io.IOException: Failed to execute the command: venv.zip/bin/python -c import sys;from distutils.dist import Distribution;install_obj = Distribution().get_command_obj('install', create=True);install_obj.prefix = sys.argv[1];install_obj.finalize_options();installed_dir = [install_obj.install_purelib];install_obj.install_purelib != install_obj.install_platlib and installed_dir.append(install_obj.install_platlib);print(installed_dir[0]);len(installed_dir) > 1 and print(installed_dir[1]) /tmp/python-dist-c4a400d8-1cbb-4b47-b667-a29eb098405d/python-requirements
      output: Could not find platform independent libraries <prefix>
      Consider setting $PYTHONHOME to <prefix>[:<exec_prefix>]
      Python path configuration:
        PYTHONHOME = (not set)
        PYTHONPATH = (not set)
        program name = 'venv.zip/bin/python'
        isolated = 0
        environment = 1
        user site = 1
        import site = 1
        sys._base_executable = '/tmp/python-dist-c4a400d8-1cbb-4b47-b667-a29eb098405d/python-archives/venv.zip/bin/python'
        sys.base_prefix = '/usr'
        sys.base_exec_prefix = '/usr'
        sys.executable = '/tmp/python-dist-c4a400d8-1cbb-4b47-b667-a29eb098405d/python-archives/venv.zip/bin/python'
        sys.prefix = '/usr'
        sys.exec_prefix = '/usr'
        sys.path = [
          '/usr/lib/python38.zip',
          '/usr/lib/python3.8',
          '/usr/lib/python3.8/lib-dynload',
        ]
      Fatal Python error: init_fs_encoding: failed to get the Python codec of the filesystem encoding
      Python runtime state: core initialized
      ModuleNotFoundError: No module named 'encodings'
      Current thread 0x00007ff9e9241740 (most recent call first):
      <no Python frame>
              at org.apache.flink.python.util.PythonEnvironmentManagerUtils.execute(PythonEnvironmentManagerUtils.java:211)
              at org.apache.flink.python.util.PythonEnvironmentManagerUtils.getSitePackagesPath(PythonEnvironmentManagerUtils.java:171)
              at org.apache.flink.python.util.PythonEnvironmentManagerUtils.pipInstallRequirements(PythonEnvironmentManagerUtils.java:99)
              at org.apache.flink.python.env.beam.ProcessPythonEnvironmentManager.installRequirements(ProcessPythonEnvironmentManager.java:231)
              at org.apache.flink.python.env.beam.ProcessPythonEnvironmentManager.lambda$open$0(ProcessPythonEnvironmentManager.java:130)
              at org.apache.flink.python.env.beam.ProcessPythonEnvironmentManager$PythonEnvResources.createResource(ProcessPythonEnvironmentManager.java:501)
              at org.apache.flink.python.env.beam.ProcessPythonEnvironmentManager$PythonEnvResources.getOrAllocateSharedResource(ProcessPythonEnvironmentManager.java:468)
              at org.apache.flink.python.env.beam.ProcessPythonEnvironmentManager.open(ProcessPythonEnvironmentManager.java:118)
              at org.apache.flink.streaming.api.runners.python.beam.BeamPythonFunctionRunner.open(BeamPythonFunctionRunner.java:214)
              at org.apache.flink.streaming.api.operators.python.AbstractPythonFunctionOperator.open(AbstractPythonFunctionOperator.java:131)
              at org.apache.flink.streaming.api.operators.python.AbstractOneInputPythonFunctionOperator.open(AbstractOneInputPythonFunctionOperator.java:116)
              at org.apache.flink.streaming.api.operators.python.PythonProcessOperator.open(PythonProcessOperator.java:59)
              at org.apache.flink.streaming.runtime.tasks.RegularOperatorChain.initializeStateAndOpenOperators(RegularOperatorChain.java:110)
              at org.apache.flink.streaming.runtime.tasks.StreamTask.restoreGates(StreamTask.java:711)
              at org.apache.flink.streaming.runtime.tasks.StreamTaskActionExecutor$SynchronizedStreamTaskActionExecutor.call(StreamTaskActionExecutor.java:100)
              at org.apache.flink.streaming.runtime.tasks.StreamTask.restoreInternal(StreamTask.java:687)
              at org.apache.flink.streaming.runtime.tasks.StreamTask.restore(StreamTask.java:654)
              at org.apache.flink.runtime.taskmanager.Task.runWithSystemExitMonitoring(Task.java:958)
              at org.apache.flink.runtime.taskmanager.Task.restoreAndInvoke(Task.java:927)
              at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:766)
              at org.apache.flink.runtime.taskmanager.Task.run(Task.java:575)
              at java.base/java.lang.Thread.run(Thread.java:829)
      org.apache.flink.client.program.ProgramAbortException: java.lang.RuntimeException: Python process exits with code: 1
              at org.apache.flink.client.python.PythonDriver.main(PythonDriver.java:140)
              at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
              at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
              at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
              at java.base/java.lang.reflect.Method.invoke(Method.java:566)
              at org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:355)
              at org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:222)
              at org.apache.flink.client.ClientUtils.executeProgram(ClientUtils.java:114)
              at org.apache.flink.client.cli.CliFrontend.executeProgram(CliFrontend.java:812)
              at org.apache.flink.client.cli.CliFrontend.run(CliFrontend.java:246)
              at org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1054)
              at org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132)
              at org.apache.flink.runtime.security.contexts.NoOpSecurityContext.runSecured(NoOpSecurityContext.java:28)
              at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)
      Caused by: java.lang.RuntimeException: Python process exits with code: 1
              at org.apache.flink.client.python.PythonDriver.main(PythonDriver.java:130)
              ... 13 more
      

       

       

      Attachments

        Activity

          People

            Unassigned Unassigned
            kafkachris Kafka Chris
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: