Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-5567 [Umbrella] Stabilize MR framework w.r.t ResourceManager restart
  3. MAPREDUCE-5466

Historyserver does not refresh the result of restarted jobs after RM restart

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • None
    • 2.1.1-beta
    • None
    • None
    • Reviewed

    Description

      Restart RM when sort job is running and verify that the job passes successfully after RM restarts.

      Once the job finishes successfully, run job status command for sort job. It shows "Job state =FAILED". Job history server does not update the result for the job which restarted after RM restart.

      hadoop job -status job_1375923346354_0003
      13/08/08 01:24:13 INFO mapred.ClientServiceDelegate: Application state is completed. FinalApplicationStatus=SUCCEEDED. Redirecting to job history server

      Job: job_1375923346354_0003
      Job File: hdfs://host1:port1/history/done/2013/08/08/000000/job_1375923346354_0003_conf.xml
      Job Tracking URL : http://historyserver:port2/jobhistory/job/job_1375923346354_0003
      Uber job : false
      Number of maps: 80
      Number of reduces: 1
      map() completion: 0.0
      reduce() completion: 0.0
      Job state: FAILED
      retired: false
      reason for failure: There are no failed tasks for the job. Job is failed due to some other reason and reason can be found in the logs.
      Counters not available. Job is retired.

      Attachments

        1. MAPREDUCE-5466.patch
          14 kB
          Jian He
        2. MAPREDUCE-5466.1.patch
          23 kB
          Jian He
        3. MAPREDUCE-5466.2.patch
          22 kB
          Jian He
        4. MAPREDUCE-5466.3.patch
          22 kB
          Jian He

        Issue Links

          Activity

            People

              jianhe Jian He
              yeshavora Yesha Vora
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: