Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-1907

ZkClient can block controlled shutdown indefinitely

    XMLWordPrintableJSON

Details

    Description

      There are some calls to ZkClient via ZkUtils in KafkaServer.controlledShutdown() that can block indefinitely because they internally call waitUntilConnected. The ZkClient API doesn't provide an alternative with timeouts, so fixing this will require enforcing timeouts in some other way.

      This may be a more general issue if there are any non daemon threads that also call ZkUtils methods.

      Stacktrace showing the issue:

      "Thread-2" prio=10 tid=0xb3305000 nid=0x4758 waiting on condition [0x6ad69000]
         java.lang.Thread.State: TIMED_WAITING (parking)
          at sun.misc.Unsafe.park(Native Method)
          - parking to wait for  <0x70a93368> (a java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
          at java.util.concurrent.locks.LockSupport.parkUntil(LockSupport.java:267)
          at java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitUntil(AbstractQueuedSynchronizer.java:2130)
          at org.I0Itec.zkclient.ZkClient.waitForKeeperState(ZkClient.java:636)
          at org.I0Itec.zkclient.ZkClient.waitUntilConnected(ZkClient.java:619)
          at org.I0Itec.zkclient.ZkClient.waitUntilConnected(ZkClient.java:615)
          at org.I0Itec.zkclient.ZkClient.retryUntilConnected(ZkClient.java:679)
          at org.I0Itec.zkclient.ZkClient.readData(ZkClient.java:766)
          at org.I0Itec.zkclient.ZkClient.readData(ZkClient.java:761)
          at kafka.utils.ZkUtils$.readDataMaybeNull(ZkUtils.scala:456)
          at kafka.utils.ZkUtils$.getController(ZkUtils.scala:65)
          at kafka.server.KafkaServer.kafka$server$KafkaServer$$controlledShutdown(KafkaServer.scala:194)
          at kafka.server.KafkaServer$$anonfun$shutdown$1.apply$mcV$sp(KafkaServer.scala:269)
          at kafka.utils.Utils$.swallow(Utils.scala:172)
          at kafka.utils.Logging$class.swallowWarn(Logging.scala:92)
          at kafka.utils.Utils$.swallowWarn(Utils.scala:45)
          at kafka.utils.Logging$class.swallow(Logging.scala:94)
          at kafka.utils.Utils$.swallow(Utils.scala:45)
          at kafka.server.KafkaServer.shutdown(KafkaServer.scala:269)
          at kafka.server.KafkaServerStartable.shutdown(KafkaServerStartable.scala:42)
          at kafka.Kafka$$anon$1.run(Kafka.scala:42)
      

      Attachments

        1. KAFKA-1907.patch
          18 kB
          Jaikiran Pai
        2. KAFKA-1907.patch
          5 kB
          Jaikiran Pai
        3. KAFKA-1907_2015-05-25_09:18:14.patch
          5 kB
          Jaikiran Pai

        Activity

          People

            jaikiran Jaikiran Pai
            ewencp Ewen Cheslack-Postava
            Jun Rao Jun Rao
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: