Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-31780

Allow users to disable "Ensemble tracking" for ZooKeeper

    XMLWordPrintableJSON

Details

    Description

      In Apache Curator an option to skip ensemble tracking was added since version 5.0.0 (CURATOR-568)

      This can be useful in certain scenarios in which CuratorFramework is accessing to ZK clusters via load balancer or Virtual IPs.
      Thus in case Zookeeper of Flink user is running behind LB or Virtual IP ensemble tracking could be disabled too.

      In case ZooKeeper is hidden under VIP it can return URL during Ensemble Tracking, which would lead to Unresolved Host Exception inside Curator Framework. On Flink level it would lead to cluster restart.

      Currently HA with ZooKeeper can even lead to the JobManager failure. The scenario of the failure is next:

      1. Flink connects to ZooKeeper via configured URL.
      2. Ensemble tracking gets a new URL of ensemble, which is not obligatory accessible for Flink, because Zookeeper is under VIP.
      3. In case of reconnect Flink fails to Zookeeper, moreover due to "UnresolvedHostException" Flink's jobManager is killed.

      Acceptance Criteria: Users of Apache Flink has a Zookeeper config option to disable ensemble tracking for ZooKeeper.

      Attachments

        Issue Links

          Activity

            People

              Oleksandr Nitavskyi Oleksandr Nitavskyi
              Oleksandr Nitavskyi Oleksandr Nitavskyi
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: