Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-4740

only .jars can be added to the Distributed Cache classpath

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 0.23.3, 2.0.2-alpha
    • 2.0.3-alpha, 0.23.5
    • mrv2
    • None
    • Reviewed

    Description

      Koji tracked down this one.

      $ cat mycat.sh
      #!/bin/sh
      cat >& /dev/null
      $JAVA_HOME/bin/jinfo $PPID | grep java.class.path
      export | grep CLASSPATH
      ls -l 
      
      $ myfile=/user/me/myclasses.zip; yarn jar
      hadoop-streaming.jar
      -Dmapreduce.job.cache.archives=hdfs:///${myfile}
      -Dmapreduce.job.classpath.archives=${myfile} -input in.txt -output out -reducer NONE -mapper mycat.sh  -file mycat.sh
      

      So, cachearchive like class.zip or class.tar.gz were never set as part of the
      classpath even though they were properly set by DistributedCache.addArchiveToClassPath.

      It looks like we are parsing the classpath out of the configs, but then throwing that away. It looks simple enough to add them in the correct place.

      Attachments

        1. MR-4740.txt
          9 kB
          Robert Joseph Evans
        2. MR-4740.txt
          9 kB
          Robert Joseph Evans
        3. MR-4740.txt
          9 kB
          Robert Joseph Evans
        4. MR-4740.txt
          6 kB
          Robert Joseph Evans

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            revans2 Robert Joseph Evans
            revans2 Robert Joseph Evans
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment