[YARN-4344] NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Critical
Resolution: Fixed
Affects Version/s: 2.7.1, 2.6.2
Fix Version/s: 2.8.0, 2.7.2, 2.6.3, 3.0.0-alpha1
Component/s: resourcemanager
Labels:
None

Hadoop Flags:

Reviewed

Description

After ~~YARN-3802~~, if an NM re-connects to the RM with changed capabilities, there can arise situations where the overall cluster resource calculation for the cluster will be incorrect leading to inconsistencies in scheduling.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

YARN-4344-branch-2.6.001.patch
23/Nov/15 09:05
12 kB
Varun Vasudev
YARN-4344.002.patch
13/Nov/15 10:59
12 kB
Varun Vasudev
YARN-4344.001.patch
11/Nov/15 11:12
11 kB
Varun Vasudev

Issue Links

is related to

YARN-4761 NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations on fair scheduler

Closed

relates to

YARN-3802 Two RMNodes for the same NodeId are used in RM sometimes after NM is reconnected.

Closed

YARN-3286 Cleanup RMNode#ReconnectNodeTransition

Resolved

Activity

People

Assignee:: Varun Vasudev

Reporter:: Varun Vasudev

Votes:: 0 Vote for this issue

Watchers:: 16 Start watching this issue

Dates

Created:: 11/Nov/15 10:48

Updated:: 06/Jan/17 00:55

Resolved:: 23/Nov/15 20:48