Description
As of Knox 0.5.0, support for WebHDFS HA is available through configuration of a HA provider and multiple URLs specified for the WEBHDFS service. There is a desire to provide similar support in Knox for HiveServer2 HA.
HiveServer2 HA has a different implementation than WebHDFS HA in that the server information is meant to be looked up via Zookeeper. The configuration information involved is typically a Zookeeper ensemble and a Zookeeper namespace to lookup the registered HiveServer2 information. The JDBC client (Hive) today uses this looked up information to randomly pick a server to then send the actual request to. In a failover scenario, the client also keeps track of the server that has failed and picks another server at random from the list of servers retrieved from Zookeeper. Logic similar to this can be implemented in Knox in a specialized Dispatch class so that the client connecting to Knox does not have to deal with HA gymnastics.
Attachments
Issue Links
- is related to
-
KNOX-567 Having Knox Supporting Hadoop/Yarn/HBase/Hive High Availability
- Closed