Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.4.0
-
None
Description
Right now there are many conditions in which history files get left around with the .inprogress extension. The cleaner doesn't remove these because it can't distinguish between something running and left over abandoned files.
There are cases its really hard to not leave these around (application killed/crashes/etc). But we should still try to handle the history files better.
For instance we could first write them to the .sparkStaging directory and then moving to the history file location before the application finishes. That way you only get completed history files.
Attachments
Issue Links
- is duplicated by
-
SPARK-18733 Spark history server file cleaner excludes in-progress files
- Resolved
-
SPARK-17119 Add configuration property to allow the history server to delete .inprogress files
- Resolved
- links to