[SPARK-1064] Make it possible to use cluster's Hadoop jars when running against YARN - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.0.0
Component/s: YARN
Labels:
None

Description

YARN applications like MapReduce and Tez rely on the cluster's Hadoop jars instead of distributing their own.

This has a couple advantages

Avoids sending a bunch of bits to every node for each app
Only a single version of Hadoop can be running on a cluster at one time, simplifying debugging
Easier to upgrade and apply patched versions of Hadoop

Attachments

Issue Links

is related to

SPARK-1233 spark on hadoop 0.23 yarn fails to run: java.lang.NoSuchFieldException: DEFAULT_MAPREDUCE_APPLICATION_CLASSPATH

Resolved

Activity

People

Assignee:: Sandy Ryza

Reporter:: Sanford Ryza

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 07/Feb/14 10:25

Updated:: 04/Apr/14 20:50

Resolved:: 12/Mar/14 00:49