Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-4041

IsolationRunner does not work as documented

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.18.0
    • Fix Version/s: 0.21.0
    • Component/s: documentation
    • Labels:
      None
    • Hadoop Flags:
      Reviewed
    • Release Note:
      Fixed a bug in IsolationRunner to make it work for map tasks.

      Description

      IsolationRunner does not work as documented in the tutorial.

      The tutorial says "To use the IsolationRunner, first set keep.failed.tasks.files to true (also see keep.tasks.files.pattern)."

      Should be:
      keep.failed.task.files (not tasks)

      After the above was set (quoted from my message on hadoop-core):
      > After the task
      > hung, I failed it via the web interface. Then I went to the node that was
      > running this task
      >
      > $ cd ...local/taskTracker/jobcache/job_200808071645_0001/work
      > (this path is already different from the tutorial's)
      >
      > $ hadoop org.apache.hadoop.mapred.IsolationRunner ../job.xml
      > Exception in thread "main" java.lang.NullPointerException
      > at
      > org.apache.hadoop.mapred.IsolationRunner.main(IsolationRunner.java:164)
      >
      > Looking at IsolationRunner code, I see this:
      >
      > 164 File workDirName = new File(lDirAlloc.getLocalPathToRead(
      > 165 TaskTracker.getJobCacheSubdir()
      > 166 + Path.SEPARATOR + taskId.getJobID()
      > 167 + Path.SEPARATOR + taskId
      > 168 + Path.SEPARATOR + "work",
      > 169 conf). toString());
      >
      > I.e. it assumes there is supposed to be a taskID subdirectory under the job
      > dir, but:
      > $ pwd
      > ...mapred/local/taskTracker/jobcache/job_200808071645_0001
      > $ ls
      > jars job.xml work
      >
      > – it's not there.

        Attachments

        1. hadoop-4041.patch
          10 kB
          Tom White
        2. HADOOP-4041-v2.patch
          39 kB
          Philip Zeyliger
        3. HADOOP-4041-v3.patch
          38 kB
          Philip Zeyliger
        4. HADOOP-4041-v4.patch
          39 kB
          Philip Zeyliger
        5. HADOOP-4041-v4-y20.patch
          39 kB
          Hemanth Yamijala
        6. org.apache.hadoop.fs.LocalDirAllocator.html
          6 kB
          Philip Zeyliger

          Issue Links

            Activity

              People

              • Assignee:
                philip Philip Zeyliger
                Reporter:
                yurip Yuri Pradkin
              • Votes:
                0 Vote for this issue
                Watchers:
                6 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: