[MAPREDUCE-4152] map task left hanging after AM dies trying to connect to RM - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.23.2
Fix Version/s: 0.23.3, 2.0.2-alpha
Component/s: mrv2
Labels:
None

Target Version/s:

0.23.3

Description

We had an instance where the RM went down for more then an hour. The application master exited with "Could not contact RM after 360000 milliseconds"

2012-04-11 10:43:36,040 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: job_1333003059741_15999Job Transitioned from RUNNING to ERROR

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

MAPREDUCE-4152.patch
23/May/12 20:27
19 kB
Thomas Graves
MAPREDUCE-4152.patch
23/May/12 20:07
19 kB
Thomas Graves
MAPREDUCE-4152.patch
01/May/12 17:20
19 kB
Thomas Graves
MAPREDUCE-4152.patch
27/Apr/12 19:23
8 kB
Thomas Graves

Activity

People

Assignee:: Thomas Graves

Reporter:: Thomas Graves

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 13/Apr/12 15:27

Updated:: 11/Oct/12 17:48

Resolved:: 30/May/12 14:55