Uploaded image for project: 'ZooKeeper'
  1. ZooKeeper
  2. ZOOKEEPER-3036

Unexpected exception in zookeeper

Add voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 3.4.10
    • None
    • quorum, server
    • None
    • 3 Zookeepers, 5 kafka servers

    Description

      We got an issue with one of the zookeeprs (Leader), causing the entire kafka cluster to fail:

      2018-05-09 02:29:01,730 [myid:3] - ERROR [LearnerHandler-/192.168.0.91:42490:LearnerHandler@648] - Unexpected exception causing shutdown while sock still open
      java.net.SocketTimeoutException: Read timed out
              at java.net.SocketInputStream.socketRead0(Native Method)
              at java.net.SocketInputStream.socketRead(SocketInputStream.java:116)
              at java.net.SocketInputStream.read(SocketInputStream.java:171)
              at java.net.SocketInputStream.read(SocketInputStream.java:141)
              at java.io.BufferedInputStream.fill(BufferedInputStream.java:246)
              at java.io.BufferedInputStream.read(BufferedInputStream.java:265)
              at java.io.DataInputStream.readInt(DataInputStream.java:387)
              at org.apache.jute.BinaryInputArchive.readInt(BinaryInputArchive.java:63)
              at org.apache.zookeeper.server.quorum.QuorumPacket.deserialize(QuorumPacket.java:83)
              at org.apache.jute.BinaryInputArchive.readRecord(BinaryInputArchive.java:99)
              at org.apache.zookeeper.server.quorum.LearnerHandler.run(LearnerHandler.java:559)
      2018-05-09 02:29:01,730 [myid:3] - WARN  [LearnerHandler-/192.168.0.91:42490:LearnerHandler@661] - ******* GOODBYE /192.168.0.91:42490 ********

       

      We would expect that zookeeper will choose another Leader and the Kafka cluster will continue to work as expected, but that was not the case.

       

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            oded@coralogix.com Oded

            Dates

              Created:
              Updated:

              Slack

                Issue deployment