Description
We can pushdown a < (select max(d) from t2) to FileSourceScan:
sql("CREATE TABLE t1 using parquet AS SELECT id AS a, id AS b FROM range(500000000L)") sql("CREATE TABLE t2 using parquet AS SELECT id AS d FROM range(20)") sql("SELECT * FROM t1 WHERE b = (select max(d) from t2)").show
Attachments
Issue Links
- is fixed by
-
SPARK-43402 FileSourceScanExec supports push down data filter with scalar subquery
- Resolved
- links to