Hadoop Map/Reduce
  1. Hadoop Map/Reduce
  2. MAPREDUCE-4152

map task left hanging after AM dies trying to connect to RM

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.23.2
    • Fix Version/s: 0.23.3, 2.0.2-alpha
    • Component/s: mrv2
    • Labels:
      None

      Description

      We had an instance where the RM went down for more then an hour. The application master exited with "Could not contact RM after 360000 milliseconds"

      2012-04-11 10:43:36,040 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: job_1333003059741_15999Job Transitioned from RUNNING to ERROR

      1. MAPREDUCE-4152.patch
        8 kB
        Thomas Graves
      2. MAPREDUCE-4152.patch
        19 kB
        Thomas Graves
      3. MAPREDUCE-4152.patch
        19 kB
        Thomas Graves
      4. MAPREDUCE-4152.patch
        19 kB
        Thomas Graves

        Activity

        No work has yet been logged on this issue.

          People

          • Assignee:
            Thomas Graves
            Reporter:
            Thomas Graves
          • Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development