Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
3.3.0
-
None
-
None
Description
Currently, Spark supports push down Filters and Aggregates to data source.
However, the Data Source V2 operator pushdown framework has the following shortcomings:
- Only simple filter and aggregate are supported, which makes it impossible to apply in most scenarios
- The incompatibility of SQL syntax makes it impossible to apply in most scenarios
- Aggregate push down does not support multiple partitions of data sources
- Spark's additional aggregate will cause some overhead
- Limit push down is not supported
- Top n push down is not supported
- Aggregate push down does not support group by expressions
- Aggregate push down does not support not use aggregate functions
- Offset push down is not supported
- Paging push down is not supported
- UDF/UDAF push down is not supported
Attachments
Issue Links
- duplicates
-
SPARK-36695 Allow passing V2 functions to data sources via V2 filters
- Resolved