[MAPREDUCE-1505] Cluster class should create the rpc client only when needed - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.20.2
Fix Version/s: 0.22.0
Component/s: client
Labels:
None

Hadoop Flags:

Reviewed
Release Note:
Lazily construct a connection to the JobTracker from the job-submission client.

Description

It will be good to have the org.apache.hadoop.mapreduce.Cluster create the rpc client object only when needed (when a call to the jobtracker is actually required). org.apache.hadoop.mapreduce.Job constructs the Cluster object internally and in many cases the application that created the Job object really wants to look at the configuration only. It'd help to not have these connections to the jobtracker especially when Job is used in the tasks (for e.g., Pig calls mapreduce.FileInputFormat.setInputPath in the tasks and that requires a Job object to be passed).

In Hadoop 20, the Job object internally creates the JobClient object, and the same argument applies there too.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

MAPREDUCE-1505_yhadoop20_9.patch
07/Apr/10 06:40
3 kB
Arun Murthy
MAPREDUCE-1505_yhadoop20.patch
22/Feb/10 21:32
3 kB
Arun Murthy
mapreduce-1505--2010-05-19.patch
19/May/10 18:13
11 kB
Dick King
mapreduce-1505--2010-05-26.patch
27/May/10 17:38
33 kB
Dick King

Issue Links

is depended upon by

MAPREDUCE-118 Job.getJobID() will always return null

Closed

Activity

People

Assignee:: Dick King

Reporter:: Devaraj Das

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 18/Feb/10 21:56

Updated:: 12/Dec/11 06:19

Resolved:: 06/Jun/10 01:28