Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
2.8.0
-
None
-
Reviewed
Description
AM blacklist addition or removal is updated only when RMAppAttempt is scheduled i.e RMAppAttemptImpl#ScheduleTransition#transition. But once attempt is scheduled if there is any removeNode/addNode in cluster then this is not updated to BlackListManager#refreshNodeHostCount. This leads BlackListManager to operate on stale NM's count. And application is in ACCEPTED state and wait forever even if blacklisted nodes are reconnected with clearing disk space.