Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-7130

Rumen crashes trying to handle MRAppMaster recovery events

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 3.3.0
    • tools/rumen
    • None
    • Reviewed

    Description

      In the event of an MRAppMaster recovery, the Job History file gets an event of the following form:

      {"type":"JOB_KILLED","event":\{"org.apache.hadoop.mapreduce.jobhistory.JobUnsuccessfulCompletion":{"jobid":"job_1532048817013_xxxx","finishTime":1534521962641,"finishedMaps":0,"finishedReduces":0,"jobStatus":"SUCCEEDED","diagnostics":{"string":"Job commit succeeded in a prior MRAppMaster attempt before it crashed. Recovering."},"failedMaps":0,"failedReduces":0,"killedMaps":0,"killedReduces":0}}}
      

      The issue seems to be around the SUCCEEDED job status for a JobUnsuccessfulCompletion:
      https://github.com/apache/hadoop/blob/e0f6ffdbad6f43fd43ec57fb68ebf5275b8b9ba0/hadoop-tools/hadoop-rumen/src/main/java/org/apache/hadoop/tools/rumen/JobBuilder.java#L609

      Which fails to find the enum here:
      https://github.com/apache/hadoop/blob/e0f6ffdbad6f43fd43ec57fb68ebf5275b8b9ba0/hadoop-tools/hadoop-rumen/src/main/java/org/apache/hadoop/tools/rumen/Pre21JobHistoryConstants.java#L50

      I'm not sure if this is an error with the Rumen parser or if the job history file is getting into an invalid state.

      Attachments

        1. MAPREDUCE-7130-001.patch
          1 kB
          Peter Bacsko
        2. MAPREDUCE-7130-002.patch
          1 kB
          Peter Bacsko

        Activity

          People

            pbacsko Peter Bacsko
            jonathan.bender Jonathan Bender
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: