Details
-
Bug
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
2.3.0, 2.3.1
-
None
Description
https://github.com/apache/spark/pull/16924/files updates the length of the inprogress files allowing history server being responsive.
Although we have a production job that has 60000 tasks per stage and due to hsync being slow it starts dropping events and the history server has wrong stats due to events being dropped.
A viable solution is not to make it sync very frequently or make it configurable.
Attachments
Issue Links
- duplicates
-
SPARK-25645 Add provision to disable EventLoggingListener default flush/hsync/hflush for all events
- Resolved
- is caused by
-
SPARK-19531 History server doesn't refresh jobs for long-life apps like thriftserver
- Resolved
- links to