Uploaded image for project: 'Oozie'
  1. Oozie
  2. OOZIE-1290

Need to get rid of OOZIE-1089 workaround

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.3.1, 3.3.2
    • None
    • core
    • None

    Description

      This seems a combination of issues that are being exposed in 2.0.2-alpha by MAPREDUCE-4549.

      MAPREDUCE-4549 introduces a check to to ensure there are not duplicate JARs in the distributed-cache (using the JAR name as identity).

      In Hadoop 2 (different from Hadoop 1), all JARs in the distributed-cache are symlink-ed to the current directory of the task.

      MRApps, when setting up the DistributedCache (MRApps#setupDistributedCache->parseDistributedCacheArtifacts) assumes that the local resources (this includes files in the CURRENT_DIR/, CURRENT_DIR/classes/ and files in CURRENT_DIR/lib/) are part of the distributed-cache already.

      For systems, like Oozie, which use a launcher job to submit the real job this poses a problem because MRApps is run from the launcher job to submit the real job. The configuration of the real job has the correct distributed-cache entries (no duplicates), but because the current dir has the same files, the submission fails.

      It seems that MRApps should not be checking dups in the distributed-cached against JARs in the CURRENT_DIR/ or CURRENT_DIR/lib/. The dup check should be done among distributed-cached entries only.

      It seems YARNRunner is symlink-ing all files in the distributed cached in the current directory. In Hadoop 1 this was done only for files added to the distributed-cache using a fragment (ie "#FOO") to trigger a symlink creation.

      Marking as a blocker because without a fix for this, Oozie cannot submit jobs to Hadoop 2 (i've debugged Oozie in a live cluster being used by BigTop thanks Roman to test their release work, and I've verified that Oozie 3.3 does not create duplicated entries in the distributed-cache)

      Attachments

        1. mr-job.logs.txt
          124 kB
          Roman Shaposhnik
        2. mr-job.conf.xml
          72 kB
          Roman Shaposhnik
        3. launcher-job.logs.txt
          43 kB
          Roman Shaposhnik
        4. launcher-job.conf.xml
          73 kB
          Roman Shaposhnik

        Issue Links

          Activity

            People

              Unassigned Unassigned
              tucu00 Alejandro Abdelnur
              Votes:
              0 Vote for this issue
              Watchers:
              15 Start watching this issue

              Dates

                Created:
                Updated: