Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4054

Do not create job.jar when submitting job

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.14.0
    • impl
    • None
    • Reviewed

    Description

      Currently Pig creates job.jar per job when submitting mapreduce job. There are several disadvantages:
      1. job.jar varies job by job, job.jar will not get reused even if jar cache is used (PIG-2672).
      2. Before job submission, we need to pack a job.jar which are mostly repacking of existing jars, this is a waste of time
      3. job.jar is a uber jar which makes debug harder and could lead to jar conflicting issue (eg, PIG-3039)

      On tez side, situation is similar, the consequence is worse since container will not be reused.

      So instead of job.jar, I would like to ship individual jar to distributed cache. Note this issue is in essence independent of PIG-4047, however, PIG-4047 would make the picture more complete in that we don't have any uber jars.

      Attachments

        1. PIG-4054-5.patch
          2 kB
          Daniel Dai
        2. PIG-4054-4.patch
          2 kB
          Daniel Dai
        3. PIG-4054-3.patch
          34 kB
          Daniel Dai
        4. PIG-4054-2.patch
          32 kB
          Daniel Dai
        5. PIG-4054-1.patch
          32 kB
          Daniel Dai

        Issue Links

          Activity

            People

              daijy Daniel Dai
              daijy Daniel Dai
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: