Details
-
New Feature
-
Status: Resolved
-
Critical
-
Resolution: Incomplete
-
None
-
None
Description
The existing (in 2.0) source/sink interface for structured streaming depends on RDDs. This dependency has two issues:
1. The RDD interface is wide and difficult to stabilize across versions. This is similar to point 1 in https://issues.apache.org/jira/browse/SPARK-15689. Ideally, a source/sink implementation created for Spark 2.x should work in Spark 10.x, assuming the JVM is still around.
2. It is difficult to swap in/out a different execution engine.
The purpose of this ticket is to create a stable interface that addresses the above two.