[YARN-1197] Support changing resources of an allocated container - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Task
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 2.1.0-beta
Fix Version/s: None
Component/s: api, graceful, nodemanager, resourcemanager
Labels:
None

Description

The current YARN resource management logic assumes resource allocated to a container is fixed during the lifetime of it. When users want to change a resource
of an allocated container the only way is releasing it and allocating a new container with expected size.
Allowing run-time changing resources of an allocated container will give us better control of resource usage in application side

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

YARN-1197 old-design-docs-patches-for-reference.zip
27/May/15 23:58
813 kB
Wangda Tan
YARN-1197_Design.pdf
26/May/15 15:53
692 kB
MENG DING
YARN-1197_Design.2015.08.21.pdf
21/Aug/15 19:09
692 kB
MENG DING
YARN-1197_Design.2015.07.07.pdf
07/Jul/15 21:58
700 kB
MENG DING
YARN-1197_Design.2015.06.24.pdf
25/Jun/15 00:19
685 kB
MENG DING

Issue Links

is depended upon by

SLIDER-938 Add ability to resize containers (Hadoop 2.8+)

Open

YARN-896 Roll up for long-lived services in YARN

Open

is related to

YARN-5085 Add support for change of container ExecutionType

Resolved

YARN-4080 Capacity planning for long running services on YARN

Open

relates to

MAPREDUCE-7180 Relaunching Failed Containers

Open

SPARK-3174 Provide elastic scaling within a Spark application

Closed

(1 relates to)

Sub-Tasks

1.	Common PB type definitions for container resizing	Closed	Wangda Tan
2.	AM-RM protocol changes to support container resizing	Closed	Wangda Tan
3.	AM-RM protocol changes to support container resizing	Resolved	MENG DING
4.	AM-NM protocol changes to support container resizing	Resolved	MENG DING
5.	ContainerManager implementation to support container resizing	Resolved	MENG DING
6.	RM-NM protocol changes and NodeStatusUpdater implementation to support container resizing	Resolved	MENG DING
7.	ContainerImpl changes to support container resizing	Resolved	MENG DING
8.	ContainerManager recovery for container resizing	Resolved	MENG DING
9.	Make ContainersMonitor can support change monitoring size of an allocated container in NM side	Resolved	MENG DING
10.	CapacityScheduler side changes to support increase/decrease container resource.	Resolved	Wangda Tan
11.	Support change container resource in RM	Resolved	Wangda Tan
12.	Roll back container resource allocation after resource increase token expires	Resolved	MENG DING
13.	Make AMRMClient support send increase container request and get increased/decreased containers	Resolved	MENG DING
14.	Make NMClient support change container resources	Resolved	MENG DING
15.	Add implementations to FairScheduler to support increase/decrease container resource	Patch Available	Wilfred Spiegelenburg
16.	Protocol changes in RM side to support change container resource	Resolved	Unassigned
17.	[YARN-1197] Add increased/decreased container to Allocation	Resolved	Unassigned
18.	[YARN-1197] Modify ApplicationMasterService to support changing container resource	Resolved	Unassigned
19.	[YARN-1197] Modify ResourceTrackerService to support passing decreased containers to RMNode	Resolved	Unassigned
20.	[YARN-1197] Add pullDecreasedContainer API to RMNode which can be used by scheduler to get newly decreased Containers	Resolved	Unassigned
21.	[YARN-1197] Add methods in FiCaSchedulerNode to support increase/decrease/reserve/unreserve change container requests/results	Resolved	Unassigned
22.	[YARN-1197] Add APIs in CSQueue to support decrease container resource and unreserve increase request	Resolved	Unassigned
23.	[YARN-1197] Add implementations to CapacityScheduler to support increase/decrease container resource	Resolved	Unassigned
24.	Merge YARN-1197 back to trunk	Resolved	Wangda Tan
25.	Support changing container cpu resource	Resolved	Yang Wang
26.	Resolve findbugs/javac warnings in YARN-1197 branch	Resolved	Wangda Tan
27.	Example of use YARN-1197	In Progress	MENG DING
28.	Increasing container resource while there is no headroom left will cause ResourceManager to crash	Resolved	MENG DING

Activity

People

Assignee:: Unassigned

Reporter:: Wangda Tan

Votes:: 13 Vote for this issue

Watchers:: 119 Start watching this issue

Dates

Created:: 13/Sep/13 08:51

Updated:: 31/Jan/19 16:54