[SPARK-3875] Add TEMP DIRECTORY configuration - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Won't Fix
Affects Version/s: 1.1.0
Fix Version/s: None
Component/s: Spark Core
Labels:
None

Description

Currently, the Spark uses "java.io.tmpdir" to find the /tmp/ directory.

Then, the /tmp/ directory is used to
1. Setup the HTTP File Server
2. Broadcast directory
3. Fetch Dependency files or jars by Executors

The size of the /tmp/ directory will keep growing. The free space of the system disk will be less.

I think we could add a configuration "spark.tmp.dir" in conf/spark-env.sh or conf/spark-defaults.conf to set this particular directory. Let's say, set the directory to a data disk.
If "spark.tmp.dir" is not set, use the default "java.io.tmpdir"

Attachments

Issue Links

links to

[Github] Pull Request #2729 (kelepi)

Activity

People

Assignee:: Unassigned

Reporter:: Patrick Liu

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 09/Oct/14 07:33

Updated:: 25/Feb/20 14:10

Resolved:: 06/Feb/15 11:11