[YARN-2915] Enable YARN RM scale out via federation using multiple RM's - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.9.0, 3.0.0-beta1
Component/s: nodemanager, resourcemanager
Labels:
- federation

Hadoop Flags:

Reviewed
Release Note:

Hide
A federation-based approach to transparently scale a single YARN cluster to tens of thousands of nodes, by federating multiple YARN standalone clusters (sub-clusters). The applications running in this federated environment will see a single massive YARN cluster and will be able to schedule tasks on any node of the federated cluster. Under the hood, the federation system will negotiate with sub-clusters ResourceManagers and provide resources to the application. The goal is to allow an individual job to “span” sub-clusters seamlessly.

Show
A federation-based approach to transparently scale a single YARN cluster to tens of thousands of nodes, by federating multiple YARN standalone clusters (sub-clusters). The applications running in this federated environment will see a single massive YARN cluster and will be able to schedule tasks on any node of the federated cluster. Under the hood, the federation system will negotiate with sub-clusters ResourceManagers and provide resources to the application. The goal is to allow an individual job to “span” sub-clusters seamlessly.

Description

This is an umbrella JIRA that proposes to scale out YARN to support large clusters comprising of tens of thousands of nodes. That is, rather than limiting a YARN managed cluster to about 4k in size, the proposal is to enable the YARN managed cluster to be elastically scalable.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

FEDERATION_CAPACITY_ALLOCATION_JIRA.pdf
08/Jul/15 00:05
751 kB
Carlo Curino
Federation-BoF.pdf
09/Jul/15 19:17
909 kB
Subramaniam Krishnan
federation-prototype.patch
08/Jul/15 00:46
729 kB
Subramaniam Krishnan
Yarn_federation_design_v1.pdf
16/May/15 01:41
787 kB
Subramaniam Krishnan
YARN-Federation-Hadoop-Summit_final.pptx
21/Jul/16 00:08
182 kB
Subramaniam Krishnan

Issue Links

blocks

YARN-6848 Move Router ClientRMServices Interceptor and chain into yarn api and common package

Open

depends upon

YARN-4879 Enhance Allocate Protocol to Identify Requests Explicitly

Resolved

is blocked by

YETUS-485 Yetus run is failing on branch after rebase/force push

Resolved

is depended upon by

REEF-337 Support REEF on YARN Federation

Open

YARN-5597 YARN Federation improvements

Resolved

is related to

YARN-5357 Timeline service v2 integration with Federation

Open

YARN-2884 Proxying all AM-RM communications

Resolved

HADOOP-12427 [JDK8] Upgrade Mockito version to 1.10.19

Resolved

relates to

HADOOP-13378 Common features between YARN and HDFS Router-based federation

Open

HDFS-10467 Router-based HDFS federation

Resolved

REEF-568 Work around the federated YARN node reports problem

Resolved

REEF-589 REEF crashes when new nodes are added to the clusters dynamically

Resolved

(3 is related to, 4 relates to)

Sub-Tasks

There are no Sub-Tasks for this issue.

Activity

People

Assignee:: Subramaniam Krishnan

Reporter:: Sriram Rao

Votes:: 2 Vote for this issue

Watchers:: 87 Start watching this issue

Dates

Created:: 02/Dec/14 17:07

Updated:: 05/Mar/19 01:44

Resolved:: 25/Sep/17 21:10