Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-29043

[History Server]Only one replay thread of FsHistoryProvider work because of straggler

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.4.4
    • 3.0.0
    • Spark Core
    • None

    Description

      As shown in the attachment, we set spark.history.fs.numReplayThreads=30 for spark history server.
      However, there is only one replay thread work because of straggler.

      Let's check the code.
      https://github.com/apache/spark/blob/7f36cd2aa5e066a807d498b8c51645b136f08a75/core/src/main/scala/org/apache/spark/deploy/history/FsHistoryProvider.scala#L509-L547

      There is a synchronous operation for all replay tasks.

      Attachments

        1. image-2019-09-11-15-09-22-912.png
          278 kB
          feiwang
        2. image-2019-09-11-15-10-25-326.png
          435 kB
          feiwang
        3. screenshot-1.png
          209 kB
          feiwang

        Issue Links

          Activity

            People

              hzfeiwang feiwang
              hzfeiwang feiwang
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: