Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-4740

only .jars can be added to the Distributed Cache classpath

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 0.23.3, 2.0.2-alpha
    • 2.0.3-alpha, 0.23.5
    • mrv2
    • None
    • Reviewed

    Description

      Koji tracked down this one.

      $ cat mycat.sh
      #!/bin/sh
      cat >& /dev/null
      $JAVA_HOME/bin/jinfo $PPID | grep java.class.path
      export | grep CLASSPATH
      ls -l 
      
      $ myfile=/user/me/myclasses.zip; yarn jar
      hadoop-streaming.jar
      -Dmapreduce.job.cache.archives=hdfs:///${myfile}
      -Dmapreduce.job.classpath.archives=${myfile} -input in.txt -output out -reducer NONE -mapper mycat.sh  -file mycat.sh
      

      So, cachearchive like class.zip or class.tar.gz were never set as part of the
      classpath even though they were properly set by DistributedCache.addArchiveToClassPath.

      It looks like we are parsing the classpath out of the configs, but then throwing that away. It looks simple enough to add them in the correct place.

      Attachments

        1. MR-4740.txt
          9 kB
          Robert Joseph Evans
        2. MR-4740.txt
          9 kB
          Robert Joseph Evans
        3. MR-4740.txt
          9 kB
          Robert Joseph Evans
        4. MR-4740.txt
          6 kB
          Robert Joseph Evans

        Issue Links

          Activity

            People

              revans2 Robert Joseph Evans
              revans2 Robert Joseph Evans
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: