Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-9457 Sorting improvements
  3. SPARK-7078

Cache-aware binary processing in-memory sort

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.5.0
    • Shuffle, Spark Core
    • None

    Description

      A cache-friendly sort algorithm that can be used eventually for:

      • sort-merge join
      • shuffle

      See the old alpha sort paper: http://research.microsoft.com/pubs/68249/alphasort.doc

      Note that state-of-the-art for sorting has improved quite a bit, but we can easily optimize the sorting algorithm itself later.

      Attachments

        Issue Links

          Activity

            People

              joshrosen Josh Rosen
              rxin Reynold Xin
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: