Description
Started a cluster. Submitted a sleep job with around 10000 maps and 1000 reduces.
Killed AM with kill -9 by which time already 7000 thousands maps got completed.
On the RM webUI, Application is stuck in Application.RUNNING state. And JobClient goes into an infinite loop as RM keeps telling the client that the application is running.
Attachments
Attachments
Issue Links
- duplicates
-
MAPREDUCE-3072 NodeManager doesn't recognize kill -9 of AM container
- Resolved
-
MAPREDUCE-2875 NM does not communicate Container crash to RM
- Closed