Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.4.4
-
None
Description
As shown in the attachment, we set spark.history.fs.numReplayThreads=30 for spark history server.
However, there is only one replay thread work because of straggler.
Let's check the code.
https://github.com/apache/spark/blob/7f36cd2aa5e066a807d498b8c51645b136f08a75/core/src/main/scala/org/apache/spark/deploy/history/FsHistoryProvider.scala#L509-L547
There is a synchronous operation for all replay tasks.
Attachments
Attachments
Issue Links
- is related to
-
SPARK-39083 Fix FsHistoryProvider race condition between update and clean app data
- Resolved
- links to