|
|
|
YARN-4243
|
YARN-149
Add retry on establishing Zookeeper conenction in EmbeddedElectorService#serviceInit
|
Xuan Gong
|
Xuan Gong
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
YARN-4107
|
YARN-149
Both RM becomes Active if all zookeepers can not connect to active RM
|
Xuan Gong
|
Xuan Gong
|
|
Resolved |
Won't Fix
|
|
|
|
|
|
|
|
YARN-4101
|
YARN-149
RM should print alert messages if Zookeeper and Resourcemanager gets connection issue
|
Xuan Gong
|
Yesha Vora
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-4092
|
YARN-149
RM HA UI redirection needs to be fixed when both RMs are in standby mode
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-3893
|
YARN-149
Both RM in active state when Admin#transitionToActive failure from refeshAll()
|
Bibin Chundatt
|
Bibin Chundatt
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-3711
|
YARN-149
Documentation of ResourceManager HA should explain configurations about listen addresses
|
Masatake Iwasaki
|
Masatake Iwasaki
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-3705
|
YARN-149
forcemanual transitionToStandby in RM-HA automatic-failover mode should change elector state
|
Masatake Iwasaki
|
Masatake Iwasaki
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-3006
|
YARN-149
Improve the error message when attempting manual failover with auto-failover enabled
|
Akira Ajisaka
|
Akira Ajisaka
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-2807
|
YARN-149
Option "--forceactive" not works as described in usage of "yarn rmadmin -transitionToActive"
|
Masatake Iwasaki
|
Wangda Tan
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-2605
|
YARN-149
[RM HA] Rest api endpoints doing redirect incorrectly
|
Xuan Gong
|
bc Wong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-2259
|
YARN-149
NM-Local dir cleanup failing when Resourcemanager switches
|
Unassigned
|
Nishan Shetty
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-2258
|
YARN-149
Aggregation of MR job logs failing when Resourcemanager switches
|
Wangda Tan
|
Nishan Shetty
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
|
YARN-1898
|
YARN-149
Standby RM's conf, stacks, logLevel, metrics, jmx and logs links are redirecting to Active RM
|
Xuan Gong
|
Yesha Vora
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1895
|
YARN-149
Add testcases to test AMRMToken on HA
|
Xuan Gong
|
Xuan Gong
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
|
YARN-1893
|
YARN-149
Make ApplicationMasterProtocol#allocate AtMostOnce
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1882
|
YARN-149
Implement and verify Scheduler#moveApplication() idempotent for CapacityScheduler/FairScheduler
|
Xuan Gong
|
Xuan Gong
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-1878
|
YARN-149
Yarn standby RM taking long to transition to active
|
Xuan Gong
|
Arpit Gupta
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-1877
|
YARN-149
Document yarn.resourcemanager.zk-auth and its scope
|
Robert Kanter
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1861
|
YARN-149
Both RM stuck in standby mode when automatic failover is enabled
|
Karthik Kambatla
|
Arpit Gupta
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1860
|
YARN-149
cancelDelegationToken should survive RM failover
|
Zhijie Shen
|
Zhijie Shen
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-1848
|
YARN-149
Persist ClusterMetrics across RM HA transitions
|
Unassigned
|
Karthik Kambatla
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-1836
|
YARN-149
Add retry cache support in ResourceManager
|
Tsuyoshi Ozawa
|
Tsuyoshi Ozawa
|
|
Resolved |
Invalid
|
|
|
|
|
|
|
|
YARN-1811
|
YARN-149
RM HA: AM link broken if the AM is on nodes other than RM
|
Robert Kanter
|
Robert Kanter
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1779
|
YARN-149
Handle AMRMTokens across RM failover
|
Jian He
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1776
|
YARN-149
renewDelegationToken should survive RM failover
|
Zhijie Shen
|
Zhijie Shen
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1766
|
YARN-149
When RM does the initiation, it should use loaded Configuration instead of bootstrap configuration.
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1765
|
YARN-149
Write test cases to verify that killApplication API works in RM HA
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1764
|
YARN-149
Handle RM fail overs after the submitApplication call.
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1763
|
YARN-149
Handle RM failovers during the submitApplication call.
|
Xuan Gong
|
Xuan Gong
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
|
YARN-1761
|
YARN-149
RMAdminCLI should check whether HA is enabled before executes transitionToActive/transitionToStandby
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1734
|
YARN-149
RM should get the updated Configurations when it transits from Standby to Active
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1705
|
YARN-149
Reset cluster-metrics on transition to standby
|
Rohith Sharma K S
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1696
|
YARN-149
Document RM HA
|
Tsuyoshi Ozawa
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1693
|
YARN-149
Cleanup YARN HAUtil class
|
Vinod Kumar Vavilapalli
|
Vinod Kumar Vavilapalli
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-1679
|
YARN-149
Make admin refresh of Fair scheduler configuration work across RM failover
|
Xuan Gong
|
Xuan Gong
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-1676
|
YARN-149
Make admin refreshUserToGroupsMappings of configuration work across RM failover
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1669
|
YARN-149
Make admin refreshServiceAcls work across RM failover
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1668
|
YARN-149
Make admin refreshAdminAcls work across RM failover
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1667
|
YARN-149
Make admin refreshSuperUserGroupsConfiguration work across RM failover
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1666
|
YARN-149
Make admin refreshNodes work across RM failover
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1665
|
YARN-149
Set better defaults for HA configs for automatic failover
|
Xuan Gong
|
Arpit Gupta
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1660
|
YARN-149
add the ability to set yarn.resourcemanager.hostname.rm-id instead of setting all the various host:port properties for RM
|
Xuan Gong
|
Arpit Gupta
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1658
|
YARN-149
Webservice should redirect to active RM when HA is enabled.
|
Cindy Li
|
Cindy Li
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1642
|
YARN-149
RMDTRenewer#getRMClient should use ClientRMProxy
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1641
|
YARN-149
ZK store should attempt a write periodically to ensure it is still Active
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1640
|
YARN-149
Manual Failover does not work in secure clusters
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1639
|
YARN-149
YARM RM HA requires different configs on different RM hosts
|
Xuan Gong
|
Arpit Gupta
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1611
|
YARN-149
Make admin refresh of capacity scheduler configuration work across RM failover
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1598
|
YARN-149
HA-related rmadmin commands don't work on a secure cluster
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1584
|
YARN-149
Support explicit failover when automatic failover is enabled
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Resolved |
Won't Fix
|
|
|
|
|
|
|
|
YARN-1579
|
YARN-149
ActiveRMInfoProto fields should be optional
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1574
|
YARN-149
RMDispatcher should be reset on transition to standby
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1573
|
YARN-149
ZK store should use a private password for root-node-acls
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1559
|
YARN-149
Race between ServerRMProxy and ClientRMProxy setting RMProxy#INSTANCE
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1540
|
YARN-149
Add an easy way to turn on HA
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Resolved |
Invalid
|
|
|
|
|
|
|
|
YARN-1535
|
YARN-149
Add an option to yarn rmadmin to clear the znode used by embedded elector
|
Unassigned
|
Karthik Kambatla
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-1525
|
YARN-149
Web UI should redirect to active RM when HA is enabled.
|
Cindy Li
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1523
|
YARN-149
Use StandbyException instead of RMNotYetReadyException
|
Karthik Kambatla
|
Bikas Saha
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1521
|
YARN-149
Mark appropriate protocol methods with the idempotent annotation or AtMostOnce annotation
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1514
|
YARN-149
Utility to benchmark ZKRMStateStore#loadState for ResourceManager-HA
|
Tsuyoshi Ozawa
|
Tsuyoshi Ozawa
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1485
|
YARN-149
Enabling HA should verify the RM service addresses configurations have been set for every RM Ids defined in RM_HA_IDs
|
Xuan Gong
|
Xuan Gong
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1482
|
YARN-149
WebApplicationProxy should be always-on w.r.t HA even if it is embedded in the RM
|
Xuan Gong
|
Vinod Kumar Vavilapalli
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1481
|
YARN-149
Move internal services logic from AdminService to ResourceManager
|
Vinod Kumar Vavilapalli
|
Vinod Kumar Vavilapalli
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1459
|
YARN-149
RM services should depend on ConfigurationProvider during startup too
|
Xuan Gong
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1411
|
YARN-149
HA config shouldn't affect NodeManager RPC addresses
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1410
|
YARN-149
Handle RM fails over after getApplicationID() and before submitApplication().
|
Xuan Gong
|
Bikas Saha
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1326
|
YARN-149
RM should log using RMStore at startup time
|
Tsuyoshi Ozawa
|
Tsuyoshi Ozawa
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1325
|
YARN-149
Enabling HA should check Configuration contains multiple RMs
|
Xuan Gong
|
Tsuyoshi Ozawa
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1323
|
YARN-149
Set HTTPS webapp address along with other RPC addresses in HAUtil
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1318
|
YARN-149
Promote AdminService to an Always-On service and merge in RMHAProtocolService
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1305
|
YARN-149
RMHAProtocolService#serviceInit should handle HAUtil's IllegalArgumentException
|
Tsuyoshi Ozawa
|
Tsuyoshi Ozawa
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1232
|
YARN-149
Configuration to support multiple RMs
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1222
|
YARN-149
Make improvements in ZKRMStateStore for fencing
|
Karthik Kambatla
|
Bikas Saha
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1202
|
YARN-149
Verify RM HA works in secure clusters
|
Unassigned
|
Karthik Kambatla
|
|
Resolved |
Won't Fix
|
|
|
|
|
|
|
|
YARN-1193
|
YARN-149
ResourceManger.clusterTimeStamp should be reset when RM transitions to active
|
Unassigned
|
Karthik Kambatla
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
|
YARN-1192
|
YARN-149
Update HAServiceState to STOPPING on RM#stop()
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
|
YARN-1181
|
YARN-149
Augment MiniYARNCluster to support HA mode
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1177
|
YARN-149
Support automatic failover using ZKFC
|
Unassigned
|
Karthik Kambatla
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-1165
|
YARN-149
Move init() of activeServices to ResourceManager#serviceStart()
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Resolved |
Not A Problem
|
|
|
|
|
|
|
|
YARN-1147
|
YARN-149
Add end-to-end tests for HA
|
Xuan Gong
|
Karthik Kambatla
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-1125
|
YARN-149
Add shutdown support to non-service RM components
|
Xuan Gong
|
Karthik Kambatla
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
YARN-1099
|
YARN-149
Revisit exception handling in ZKRMStateStore post RM HA
|
Unassigned
|
Karthik Kambatla
|
|
Resolved |
Not A Problem
|
|
|
|
|
|
|
|
YARN-1098
|
YARN-149
Separate out RM services into "Always On" and "Active"
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1068
|
YARN-149
Add admin support for HA operations
|
Karthik Kambatla
|
Karthik Kambatla
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1033
|
YARN-149
Expose RM active/standby state to Web UI and REST API
|
Karthik Kambatla
|
Nemon Lou
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1029
|
YARN-149
Allow embedding leader election into the RM
|
Karthik Kambatla
|
Bikas Saha
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1028
|
YARN-149
Add FailoverProxyProvider like capability to RMProxy
|
Karthik Kambatla
|
Bikas Saha
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1027
|
YARN-149
Implement RMHAProtocolService
|
Karthik Kambatla
|
Bikas Saha
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
YARN-1026
|
YARN-149
Test and verify ACL based ZKRMStateStore fencing for RM State Store
|
Karthik Kambatla
|
Bikas Saha
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
|
YARN-986
|
YARN-149
RM DT token service should have service addresses of both RMs
|
Karthik Kambatla
|
Vinod Kumar Vavilapalli
|
|
Closed |
Fixed
|
|
|
|
|