Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Auto Closed
-
1.1.0, 1.2.0
-
None
Description
Now that spark has a sort based shuffle, can we expect a secondary sort soon? There are some use cases where getting a sorted iterator of values per key is helpful.
Attachments
Issue Links
- is related to
-
SPARK-2045 Sort-based shuffle implementation
-
- Resolved
-
-
SPARK-10405 Support takeOrdered and topK values per key
-
- Resolved
-
- relates to
-
PIG-4504 Enable Secondary key sort feature in spark mode
-
- Closed
-
-
SPARK-15798 Secondary sort in Dataset/DataFrame
-
- Resolved
-
- links to