Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-2605

Hadoop Streaming (StreamJob) does not delete temporary job/package jar

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Not A Problem
    • 0.20.2
    • 0.22.0
    • None
    • None

    Description

      The streaming job driver (org.apache.hadoop.streaming.StreamJob) does not delete the temporary JAR file it generates after a job completes. Without the fix, /var/tmp fills up with streaming job jars until they get wiped.

      The jar name is stored in the class variable 'jar_'. The JAR is generated in 'packageJobJar()' and the name stored in jar_.

      Fix: run()/submitAndMonitorJob() should clean up the jar_ file when done. Or the JAR could be generatd as a tempfile and cleaned up automatically.

      Attachments

        Activity

          People

            Unassigned Unassigned
            gwittel Greg Wittel
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 5m
                5m
                Remaining:
                Remaining Estimate - 5m
                5m
                Logged:
                Time Spent - Not Specified
                Not Specified