Details
-
New Feature
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
Description
YARN Federation today requires manual configuration of queues within each sub-cluster, and each RM operates "in isolation". This has few issues:
- Preemption is computed locally (and might far exceed the global need)
- Jobs within a queue are forced to consume their resources "evenly" based on queue mapping
This umbrella JIRA tracks a new feature that leverages the FederationStateStore as a synchronization mechanism among RMs, and allows for allocation and preemption decisions to be based on a (close to up-to-date) global view of the cluster allocation and demand. The JIRA also tracks algorithms to automatically generate policies for Router and AMRMProxy to shape the traffic to each sub-cluster, and general "maintenance" of the FederationStateStore.
Attachments
Issue Links
- is related to
-
YARN-5597 YARN Federation improvements
- Resolved
-
YARN-11105 Prohibit jobs from running across DCs by YARN Federation
- Open
- links to