Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-10815

Public API: Streaming Sources and Sinks

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Critical
    • Resolution: Incomplete
    • None
    • None
    • Structured Streaming

    Description

      The existing (in 2.0) source/sink interface for structured streaming depends on RDDs. This dependency has two issues:

      1. The RDD interface is wide and difficult to stabilize across versions. This is similar to point 1 in https://issues.apache.org/jira/browse/SPARK-15689. Ideally, a source/sink implementation created for Spark 2.x should work in Spark 10.x, assuming the JVM is still around.

      2. It is difficult to swap in/out a different execution engine.

      The purpose of this ticket is to create a stable interface that addresses the above two.

      Attachments

        Activity

          People

            Unassigned Unassigned
            rxin Reynold Xin
            Votes:
            4 Vote for this issue
            Watchers:
            22 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: