Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-27726

Performance of InMemoryStore suffers under load

    XMLWordPrintableJSON

Details

    • Umbrella
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.3.3
    • 2.4.4, 3.0.0
    • Spark Core
    • None

    Description

      When our Spark system has been under load for an extended period of time, GC remains highly active and the jobs page becomes unresponsive even when load eases.  Please see the attached GCRateIssues for more details regarding the problem definition.

      We found a number of separate issues which are detailed in the subtasks.  I anticipate committing a single PR for all subtasks whose commits roughly align with the descriptions in the subtasks.

      The performance of the code is measured before and after the change and is attached in the document PerformanceBeforeAndAfter.  tl;dr in our use case, we saw about five (computed) orders of magnitude improvement.

      Attachments

        1. GCRateIssues.pdf
          173 kB
          David C Navas
        2. PerformanceBeforeAndAfter.pdf
          167 kB
          David C Navas

        Issue Links

          Activity

            People

              davidnavas David C Navas
              davidnavas David C Navas
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: