Description
We'd like to remove DataFrame RDD operations like map, filter, and foreach because:
- After making DataFrame a subclass of Dataset[Row], these methods conflicts with methods in Dataset.
- By returning RDDs, they are semantically improper.
It's trivial to remove them since they simply delegates to methods of DataFrame.rdd.
Attachments
Issue Links
- blocks
-
SPARK-13244 Unify DataFrame and Dataset API
- Resolved
- links to