Details
-
New Feature
-
Status: Closed
-
Major
-
Resolution: Fixed
-
3.0.0
-
None
Description
When running HS2 with LLAP and Workload Manager, HS2 becomes single point of failure as some of the states for workload management and scheduling are maintained in-memory.
The proposal is to support Active/Passive mode of high availability in which, all HS2 and tez AMs registers with ZooKeeper and a leader have to be chosen which will maintain stateful information. Clients using service discovery will always connect to the leader for submitting queries. There will also be some responsibilities for the leader, failover handling, tez session reconnect etc. Will upload some more detailed information in a separate doc.
Attachments
Attachments
Issue Links
- blocks
-
HIVE-18952 Tez session disconnect and reconnect on HS2 HA failover
- Resolved
- requires
-
TEZ-3892 getClient API for TezClient
- Closed
- links to