Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Won't Fix
-
None
-
None
Description
Spark provides the top() and takeOrdered() APIs that return "top" or "bottom" items from a given RDD.
It'd be good to have an API that returned the "top" values per key for a keyed RDD i.e. RDDpair. Such an API would be very useful for cases where the task is to only display an ordered subset of the input data.
Attachments
Issue Links
- relates to
-
SPARK-3655 Support sorting of values in addition to keys (i.e. secondary sort)
- Resolved