Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-20750

Fix TestMiniDruidCliDriver timeouts

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Tests
    • None

    Description

      Sometimes there seems to be some timeouts; today I was the lucky on getting one...

      I've taken a look and the problem seems to be the following:

      • the maven command executing the tests only runs for ~1 minute - so there is no timeout
      • in the hive log there is an exception about kafka is not able to bind an address:
        org.apache.kafka.common.KafkaException: Socket server failed to bind to localhost:9092: Address already in use.
        	at kafka.network.Acceptor.openServerSocket(SocketServer.scala:442) ~[hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at kafka.network.Acceptor.<init>(SocketServer.scala:332) ~[hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at kafka.network.SocketServer$$anonfun$createAcceptorAndProcessors$1.apply(SocketServer.scala:149) ~[hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at kafka.network.SocketServer$$anonfun$createAcceptorAndProcessors$1.apply(SocketServer.scala:145) ~[hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:59) ~[hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:48) ~[hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at kafka.network.SocketServer.createAcceptorAndProcessors(SocketServer.scala:145) ~[hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at kafka.network.SocketServer.startup(SocketServer.scala:94) ~[hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at kafka.server.KafkaServer.startup(KafkaServer.scala:250) [hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:38) [hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at org.apache.hive.kafka.SingleNodeKafkaCluster.serviceStart(SingleNodeKafkaCluster.java:75) [hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at org.apache.hadoop.service.AbstractService.start(AbstractService.java:194) [hadoop-common-3.1.0.jar:?]
        	at org.apache.hadoop.hive.ql.QTestUtil.setupMiniCluster(QTestUtil.java:636) [hive-it-util-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at org.apache.hadoop.hive.ql.QTestUtil.<init>(QTestUtil.java:509) [hive-it-util-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at org.apache.hadoop.hive.cli.control.CoreCliDriver$1.invokeInternal(CoreCliDriver.java:76) [hive-it-util-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at org.apache.hadoop.hive.cli.control.CoreCliDriver$1.invokeInternal(CoreCliDriver.java:63) [hive-it-util-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at org.apache.hadoop.hive.util.ElapsedTimeLoggingWrapper.invoke(ElapsedTimeLoggingWrapper.java:33) [hive-it-util-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at org.apache.hadoop.hive.cli.control.CoreCliDriver.beforeClass(CoreCliDriver.java:78) [hive-it-util-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at org.apache.hadoop.hive.cli.control.CliAdapter$1$1.evaluate(CliAdapter.java:71) [hive-it-util-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	at org.junit.rules.RunRules.evaluate(RunRules.java:20) [junit-4.11.jar:?]
        	at org.junit.runners.ParentRunner.run(ParentRunner.java:309) [junit-4.11.jar:?]
        	at org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365) [surefire-junit4-2.21.0.jar:2.21.0]
        	at org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:273) [surefire-junit4-2.21.0.jar:2.21.0]
        	at org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:238) [surefire-junit4-2.21.0.jar:2.21.0]
        	at org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159) [surefire-junit4-2.21.0.jar:2.21.0]
        	at org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:379) [surefire-booter-2.21.0.jar:2.21.0]
        	at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:340) [surefire-booter-2.21.0.jar:2.21.0]
        	at org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:125) [surefire-booter-2.21.0.jar:2.21.0]
        	at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:413) [surefire-booter-2.21.0.jar:2.21.0]
        Caused by: java.net.BindException: Address already in use
        	at sun.nio.ch.Net.bind0(Native Method) ~[?:1.8.0_102]
        	at sun.nio.ch.Net.bind(Net.java:433) ~[?:1.8.0_102]
        	at sun.nio.ch.Net.bind(Net.java:425) ~[?:1.8.0_102]
        	at sun.nio.ch.ServerSocketChannelImpl.bind(ServerSocketChannelImpl.java:223) ~[?:1.8.0_102]
        	at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:74) ~[?:1.8.0_102]
        	at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:67) ~[?:1.8.0_102]
        	at kafka.network.Acceptor.openServerSocket(SocketServer.scala:438) ~[hive-it-druid-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
        	... 28 more
        

      related ptest batch (will be available for 24h or so):
      http://104.198.109.242/logs/PreCommit-HIVE-Build-14482/failed/196-TestMiniDruidCliDriver-druidmini_masking.q-druidmini_test1.q-druidkafkamini_basic.q-and-2-more/

      Attachments

        Activity

          People

            Unassigned Unassigned
            kgyrtkirk Zoltan Haindrich
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: