ZooKeeper
  1. ZooKeeper
  2. ZOOKEEPER-485

need ops documentation that details supervision of ZK server processes

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 3.3.0
    • Component/s: documentation, server
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      We need ops documentation detailing what to do if the ZK server VM fails - by fail I mean the jvm process
      exits/dies/crashes/etc...

      In general a supervisor process should be used to start/stop/restart/etc... the ZK server vm.

      Something like daemontools http://cr.yp.to/daemontools.html could be used, or more simply a wrapper script
      should monitor the status of the pid and restart if the jvm fails. It's up to the operator, if this is not done
      automatically then it will have to be done manually, by operator restarting the ZK server jvm

      The inherent behavior of ZK wrt to failures - ie that it automatically recovers as long as quorum is maintained -
      fits into this nicely.

        Issue Links

          Activity

          Hide
          Brett Eisenberg added a comment -

          FWIW, Zookeeper works great under SMF (http://en.wikipedia.org/wiki/Service_Management_Facility)

          Show
          Brett Eisenberg added a comment - FWIW, Zookeeper works great under SMF ( http://en.wikipedia.org/wiki/Service_Management_Facility )
          Hide
          Patrick Hunt added a comment -

          Brett – nice, we'll try to include that in the writeup. Do you have anything we could use as an example of how to run ZK under SMF?

          Show
          Patrick Hunt added a comment - Brett – nice, we'll try to include that in the writeup. Do you have anything we could use as an example of how to run ZK under SMF?
          Hide
          Patrick Hunt added a comment -

          this patch details having a supervisory process (also fills out the monitoring section)

          Show
          Patrick Hunt added a comment - this patch details having a supervisory process (also fills out the monitoring section)
          Hide
          Hadoop QA added a comment -

          +1 overall. Here are the results of testing the latest attachment
          http://issues.apache.org/jira/secure/attachment/12436408/ZOOKEEPER-485.patch
          against trunk revision 912052.

          +1 @author. The patch does not contain any @author tags.

          +0 tests included. The patch appears to be a documentation patch that doesn't require tests.

          +1 javadoc. The javadoc tool did not generate any warning messages.

          +1 javac. The applied patch does not increase the total number of javac compiler warnings.

          +1 findbugs. The patch does not introduce any new Findbugs warnings.

          +1 release audit. The applied patch does not increase the total number of release audit warnings.

          +1 core tests. The patch passed core unit tests.

          +1 contrib tests. The patch passed contrib unit tests.

          Test results: http://hudson.zones.apache.org/hudson/job/Zookeeper-Patch-h7.grid.sp2.yahoo.net/74/testReport/
          Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Zookeeper-Patch-h7.grid.sp2.yahoo.net/74/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
          Console output: http://hudson.zones.apache.org/hudson/job/Zookeeper-Patch-h7.grid.sp2.yahoo.net/74/console

          This message is automatically generated.

          Show
          Hadoop QA added a comment - +1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12436408/ZOOKEEPER-485.patch against trunk revision 912052. +1 @author. The patch does not contain any @author tags. +0 tests included. The patch appears to be a documentation patch that doesn't require tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. +1 core tests. The patch passed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Zookeeper-Patch-h7.grid.sp2.yahoo.net/74/testReport/ Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Zookeeper-Patch-h7.grid.sp2.yahoo.net/74/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Console output: http://hudson.zones.apache.org/hudson/job/Zookeeper-Patch-h7.grid.sp2.yahoo.net/74/console This message is automatically generated.
          Hide
          Mahadev konar added a comment -

          +1 the patch looks good.

          I just committed this. thanks pat.

          Show
          Mahadev konar added a comment - +1 the patch looks good. I just committed this. thanks pat.
          Hide
          Hudson added a comment -

          Integrated in ZooKeeper-trunk #706 (See http://hudson.zones.apache.org/hudson/job/ZooKeeper-trunk/706/)
          . Need ops documentation that details supervision of ZK server processes. (phunt via mahadev)

          Show
          Hudson added a comment - Integrated in ZooKeeper-trunk #706 (See http://hudson.zones.apache.org/hudson/job/ZooKeeper-trunk/706/ ) . Need ops documentation that details supervision of ZK server processes. (phunt via mahadev)

            People

            • Assignee:
              Patrick Hunt
              Reporter:
              Patrick Hunt
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development