Pig
  1. Pig
  2. PIG-2672

Optimize the use of DistributedCache

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.13.0
    • Component/s: None
    • Labels:
      None

      Description

      Pig currently copies jar files to a temporary location in hdfs and then adds them to DistributedCache for each job launched. This is inefficient in terms of

      • Space - The jars are distributed to task trackers for every job taking up lot of local temporary space in tasktrackers.
      • Performance - The jar distribution impacts the job launch time.
      1. PIG-2672-7.patch
        11 kB
        Aniket Mokashi
      2. PIG-2672-5.patch
        43 kB
        Aniket Mokashi
      3. PIG-2672-10.patch
        12 kB
        Aniket Mokashi
      4. PIG-2672.patch
        23 kB
        Aniket Mokashi

        Issue Links

          Activity

            People

            • Assignee:
              Aniket Mokashi
              Reporter:
              Rohini Palaniswamy
            • Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development