Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-7855

Move hash-style shuffle code out of ExternalSorter and into own file

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.5.0
    • Shuffle, Spark Core
    • None

    Description

      ExternalSorter contains a bunch of code for handling the bypassMergeThreshold / hash-style shuffle path. I think that it would significantly simplify the code to move this functionality out of ExternalSorter and into a separate class which shares a common interface (insertAll / writePartitionedFile()). This is a stepping-stone towards eventually removing this bypass path (see SPARK-6026)

      Attachments

        Issue Links

          Activity

            People

              joshrosen Josh Rosen
              joshrosen Josh Rosen
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: