Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-13457

Remove DataFrame RDD operations

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.0.0
    • 2.0.0
    • SQL
    • None

    Description

      We'd like to remove DataFrame RDD operations like map, filter, and foreach because:

      1. After making DataFrame a subclass of Dataset[Row], these methods conflicts with methods in Dataset.
      2. By returning RDDs, they are semantically improper.

      It's trivial to remove them since they simply delegates to methods of DataFrame.rdd.

      Attachments

        Issue Links

          Activity

            People

              lian cheng Cheng Lian
              lian cheng Cheng Lian
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: