Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      This umbrella JIRA tracks set of improvements over the YARN Federation MVP (YARN-2915)

        Attachments

          Issue Links

          1.
          Federation maintenance mechanisms (simple CLI and command propagation) Sub-task Open Carlo Curino
          2.
          Federation "Capacity Allocation" across sub-cluster Sub-task Open Carlo Curino
          3.
          Add versioning for FederationStateStore Sub-task Open Unassigned
          4.
          New ResourceCalculator implementation that operates on vector of resources, but respect sum of ratios Sub-task Open Unassigned
          5.
          Add support for AMRMProxy HA Sub-task Resolved Botong Huang
          6.
          Advanced Federation UI based on YARN UI v2 Sub-task Patch Available Tanuj Nayak
          7.
          Refactoring SQLFederationStateStore by avoiding to recreate the connections at every call Sub-task Open Unassigned
          8.
          Consider running RM tests against the Router Sub-task Open Unassigned
          9.
          Refactoring Router services to use common util classes for pipeline creations Sub-task Open Unassigned
          10.
          Create SecureLogin inside Router Sub-task Open Unassigned
          11.
          Replace MockResourceManagerFacade with MockRM for AMRMProxy/Router tests Sub-task Open Unassigned
          12.
          Handle containerId duplicate without failing the heartbeat in Federation Interceptor Sub-task Open Botong Huang
          13.
          Federation Router (hiding multiple RMs for ApplicationClientProtocol) phase 2 Sub-task Open Unassigned
          14.
          Metrics for Federation AMRMProxy Sub-task Open Young Chen
          15.
          Add support for work preserving NM restart when FederationInterceptor is enabled in AMRMProxyService Sub-task Resolved Botong Huang
          16.
          Adding RM ClusterId in AppInfo Sub-task Patch Available Tanuj Nayak
          17.
          Adding RM Cluster Id in ApplicationReport Sub-task Open Unassigned
          18.
          Federation Router Web Service fixes Sub-task Resolved Íñigo Goiri
          19.
          Race condition between second app attempt and UAM timeout when first attempt node is down Sub-task Open Botong Huang
          20.
          Handle AM register requests asynchronously in FederationInterceptor Sub-task Resolved Botong Huang
          21.
          Add config in FederationRMFailoverProxy to not bypass facade cache when failing over Sub-task Resolved Botong Huang
          22.
          AMRMProxy recover should catch for all throwable to avoid premature exit Sub-task Resolved Botong Huang
          23.
          Yarn RM Epoch should wrap around Sub-task Resolved Young Chen
          24.
          [AMRMProxy] Add sub-cluster timeout in LocalityMulticastAMRMProxyPolicy Sub-task Resolved Botong Huang
          25.
          [AMRMProxy] Metrics for AMRMClientRelayer inside FederationInterceptor Sub-task Resolved Young Chen
          26.
          [AMRMProxy] More robust responseId resync after an YarnRM master slave switch Sub-task Resolved Botong Huang
          27.
          [AMRMProxy] FederationInterceptor upgrade: home sub-cluster heartbeat async Sub-task Resolved Botong Huang
          28.
          LocalityMulticastAMRMProxyPolicy should fallback to random sub-cluster when cannot resolve resource Sub-task Resolved Botong Huang
          29.
          Add Yarnclient#yarnclusterMetrics API implementation in router Sub-task Resolved Bibin A Chundatt
          30.
          Refactor the UAM heartbeat thread in preparation for YARN-8696 Sub-task Resolved Botong Huang
          31.
          Add clean up for FederationStore apps Sub-task Resolved Unassigned
          32.
          [AMRMProxy] Fix concurrent re-register due to YarnRM failover in AMRMClientRelayer Sub-task Resolved Botong Huang
          33.
          Add audit logs for router service Sub-task Open Unassigned
          34.
          Create HomePolicyManager that sends all the requests to the home subcluster Sub-task Resolved Íñigo Goiri
          35.
          AMRMProxyPolicies should accept heartbeat response from new/unknown subclusters Sub-task Resolved Botong Huang
          36.
          [AMRMProxy] Fix thread leak in AMRMClientRelayer and UAM client Sub-task Patch Available Botong Huang
          37.
          Fix FederationInterceptor#allocate to set application priority in allocateResponse Sub-task Open Bilwa S T
          38.
          [Router] Federation: routing getContainers REST invocations transparently to multiple RMs Sub-task Patch Available Giovanni Matteo Fumarola
          39.
          [Router] Add JvmMetricsInfo and pause monitor Sub-task Open Bilwa S T

            Activity

              People

              • Assignee:
                subru Subru Krishnan
                Reporter:
                subru Subru Krishnan
              • Votes:
                0 Vote for this issue
                Watchers:
                17 Start watching this issue

                Dates

                • Created:
                  Updated: