Details
Description
// code placeholder package org.apache.spark.sql . . . class Dataset[T] private[sql]( . . . def groupBy(col1: String, cols: String*): RelationalGroupedDataset = { val colNames: Seq[String] = col1 +: cols RelationalGroupedDataset( toDF(), colNames.map(colName => resolve(colName)), RelationalGroupedDataset.GroupByType) }
should append a `.distinct` after `colNames` when used in `groupBy`
Not sure if the community agrees with this or it's up to the users to perform the distinct operation