Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
ExternalSorter contains a bunch of code for handling the bypassMergeThreshold / hash-style shuffle path. I think that it would significantly simplify the code to move this functionality out of ExternalSorter and into a separate class which shares a common interface (insertAll / writePartitionedFile()). This is a stepping-stone towards eventually removing this bypass path (see SPARK-6026)
Attachments
Issue Links
- blocks
-
SPARK-6026 Eliminate the bypassMergeThreshold parameter and associated hash-ish shuffle within the Sort shuffle code
- Resolved
- links to