Uploaded image for project: 'ZooKeeper'
  1. ZooKeeper
  2. ZOOKEEPER-4783

leader crash because of zxid 32b rollover but no other server takes the lead

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.8.3
    • None
    • None
    • None
    • Linux amd64 Ubuntu 20.04.5

      Java OpenJDK17U-jre_x64_linux_hotspot_17.0.8.1_1.tar.gz

    Description

      Got a 5 node cluster running on baremetal servers (with NVMe) used by a ClickHouse cluster on a separate cluster.

      This morning, a crash on the leader did let my clusters unusable as while the leader crashed, none of the 4 followers did take the lead

       

      zookeeper leader was zookeeper08

      05/06/07/09 were the followers

       

      Only a restart of zookeeper05 process did unfreeze the whole cluster

      Attachments

        1. zoo.cfg
          1 kB
          Stéphane Loeuillet
        2. zookeeper_crash.log
          22 kB
          Stéphane Loeuillet

        Activity

          People

            Unassigned Unassigned
            sloeuillet Stéphane Loeuillet
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: