[SPARK-31208] Expose the ability for user to cleanup shuffle files - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.0.0, 3.1.0
Fix Version/s: 3.1.0
Component/s: Kubernetes, Spark Core
Labels:
None

Description

Dynamic scaling on Kubernetes (introduced in Spark 3) depends on only shutting down executors without shuffle files. However Spark does not aggressively clean up shuffle files (see ~~SPARK-5836~~) and instead depends on JVM GC on the driver to trigger deletes. We already have a mechanism to explicitly clean up shuffle files from the ALS algorithm where we create a lot of quickly orphaned shuffle files. We should expose this as an advanced developer feature to enable people to better clean-up shuffle files improving dynamic scaling of their jobs on Kubernetes.

Attachments

Issue Links

is related to

SPARK-38417 Remove `Experimental` from `RDD.cleanShuffleDependencies` API

Resolved

relates to

SPARK-5836 Highlight in Spark documentation that by default Spark does not delete its temporary files

Resolved

links to

GitHub Pull Request #28038

Activity

People

Assignee:: Holden Karau

Reporter:: Holden Karau

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 20/Mar/20 18:51

Updated:: 05/Mar/22 01:04

Resolved:: 07/Apr/20 20:54