Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-13485

(Dataset-oriented) API evolution in Spark 2.0

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • None
    • 2.0.0
    • SQL

    Description

      As part of Spark 2.0, we want to create a stable API foundation for Dataset to become the main user-facing API in Spark. This ticket tracks various tasks related to that.

      The main high level changes are:

      1. Merge Dataset/DataFrame
      2. Create a more natural entry point for Dataset (SQLContext/HiveContext are not ideal because of the name "SQL"/"Hive", and "SparkContext" is not ideal because of its heavy dependency on RDDs)
      3. First class support for sessions
      4. First class support for some system catalog

      See the design doc for more details.

      Attachments

        1. API Evolution in Spark 2.0.pdf
          772 kB
          Reynold Xin
        There are no Sub-Tasks for this issue.

        Activity

          People

            rxin Reynold Xin
            rxin Reynold Xin
            Votes:
            0 Vote for this issue
            Watchers:
            32 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: