Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-12235

Enhance mutate() to support replace existing columns

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.5.2
    • 2.0.0
    • SparkR
    • None

    Description

      mutate() in the dplyr package supports adding new columns and replacing existing columns. But currently the implementation of mutate() in SparkR supports adding new columns only.

      Also make the behavior of mutate more consistent with that in dplyr.
      1. Throw error message when there are duplicated column names in the DataFrame being mutated.
      2. when there are duplicated column names in specified columns by arguments, the last column of the same name takes effect.

      Attachments

        Issue Links

          Activity

            People

              sunrui Sun Rui
              sunrui Sun Rui
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: