Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-12337

Implement dropDuplicates() method of DataFrame in SparkR

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.5.2
    • 2.0.0
    • SparkR
    • None

    Description

      distinct() and unique() drop duplicated rows on all columns. While dropDuplicates() can drop duplicated rows on selected columns.

      Attachments

        Issue Links

          Activity

            People

              sunrui Sun Rui
              sunrui Sun Rui
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: