Uploaded image for project: 'ZooKeeper'
  1. ZooKeeper
  2. ZOOKEEPER-485

need ops documentation that details supervision of ZK server processes

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 3.3.0
    • documentation, server
    • None
    • Reviewed

    Description

      We need ops documentation detailing what to do if the ZK server VM fails - by fail I mean the jvm process
      exits/dies/crashes/etc...

      In general a supervisor process should be used to start/stop/restart/etc... the ZK server vm.

      Something like daemontools http://cr.yp.to/daemontools.html could be used, or more simply a wrapper script
      should monitor the status of the pid and restart if the jvm fails. It's up to the operator, if this is not done
      automatically then it will have to be done manually, by operator restarting the ZK server jvm

      The inherent behavior of ZK wrt to failures - ie that it automatically recovers as long as quorum is maintained -
      fits into this nicely.

      Attachments

        1. ZOOKEEPER-485.patch
          2 kB
          Patrick D. Hunt

        Issue Links

          Activity

            People

              phunt Patrick D. Hunt
              phunt Patrick D. Hunt
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: