Hadoop Map/Reduce
  1. Hadoop Map/Reduce
  2. MAPREDUCE-1505

Cluster class should create the rpc client only when needed

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.20.2
    • Fix Version/s: 0.22.0
    • Component/s: client
    • Labels:
      None
    • Hadoop Flags:
      Reviewed
    • Release Note:
      Lazily construct a connection to the JobTracker from the job-submission client.

      Description

      It will be good to have the org.apache.hadoop.mapreduce.Cluster create the rpc client object only when needed (when a call to the jobtracker is actually required). org.apache.hadoop.mapreduce.Job constructs the Cluster object internally and in many cases the application that created the Job object really wants to look at the configuration only. It'd help to not have these connections to the jobtracker especially when Job is used in the tasks (for e.g., Pig calls mapreduce.FileInputFormat.setInputPath in the tasks and that requires a Job object to be passed).

      In Hadoop 20, the Job object internally creates the JobClient object, and the same argument applies there too.

      1. MAPREDUCE-1505_yhadoop20_9.patch
        3 kB
        Arun C Murthy
      2. MAPREDUCE-1505_yhadoop20.patch
        3 kB
        Arun C Murthy
      3. mapreduce-1505--2010-05-19.patch
        11 kB
        Dick King
      4. mapreduce-1505--2010-05-26.patch
        33 kB
        Dick King

        Issue Links

          Activity

          No work has yet been logged on this issue.

            People

            • Assignee:
              Dick King
              Reporter:
              Devaraj Das
            • Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development