Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
4.0.0
Description
The following code fails in the last step with Spark Connect.
The root cause is that Connect server triggers physical plan on a streaming Dataframe in SparkConnectPlanner.scala. Better to avoid that entirely, but at least for streaming it should be avoided since it cannot be done with a batch execution engine.
df = spark.readStream.format("rate").option("numPartitions", "1").load() df.createOrReplaceTempView("temp_view") view_df = spark.sql("SELECT * FROM temp_view") // FAILS