Uploaded image for project: 'Apache Cassandra'
  1. Apache Cassandra
  2. CASSANDRA-8072

Exception during startup: Unable to gossip with any seeds

Agile BoardAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Normal

    Description

      When Opscenter 4.1.4 or 5.0.1 tries to provision a 2-node DSC 2.0.10 cluster in either ec2 or locally, an error occurs sometimes with one of the nodes refusing to start C*. The error in the /var/log/cassandra/system.log is:

      ERROR [main] 2014-10-06 15:54:52,292 CassandraDaemon.java (line 513) Exception encountered during startup
      java.lang.RuntimeException: Unable to gossip with any seeds
      at org.apache.cassandra.gms.Gossiper.doShadowRound(Gossiper.java:1200)
      at org.apache.cassandra.service.StorageService.checkForEndpointCollision(StorageService.java:444)
      at org.apache.cassandra.service.StorageService.prepareToJoin(StorageService.java:655)
      at org.apache.cassandra.service.StorageService.initServer(StorageService.java:609)
      at org.apache.cassandra.service.StorageService.initServer(StorageService.java:502)
      at org.apache.cassandra.service.CassandraDaemon.setup(CassandraDaemon.java:378)
      at org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.java:496)
      at org.apache.cassandra.service.CassandraDaemon.main(CassandraDaemon.java:585)
      INFO [StorageServiceShutdownHook] 2014-10-06 15:54:52,326 Gossiper.java (line 1279) Announcing shutdown
      INFO [StorageServiceShutdownHook] 2014-10-06 15:54:54,326 MessagingService.java (line 701) Waiting for messaging service to quiesce
      INFO [ACCEPT-localhost/127.0.0.1] 2014-10-06 15:54:54,327 MessagingService.java (line 941) MessagingService has terminated the accept() thread

      This errors does not always occur when provisioning a 2-node cluster, but probably around half of the time on only one of the nodes. I haven't been able to reproduce this error with DSC 2.0.9, and there have been no code or definition file changes in Opscenter.

      I can reproduce locally with the above steps.  I'm happy to test any proposed fixes since I'm the only person able to reproduce reliably so far.

      Attachments

        1. casandra-system-log-with-assert-patch.log
          11 kB
          Ryan Springer
        2. trace_logs.tar.bz2
          76 kB
          John Alberts
        3. cas-dev-dt-01-uw1-cassandra-seed01_logs.tar.bz2
          481 kB
          John Alberts
        4. cas-dev-dt-01-uw1-cassandra-seed02_logs.tar.bz2
          508 kB
          John Alberts
        5. cas-dev-dt-01-uw1-cassandra02_logs.tar.bz2
          94 kB
          John Alberts
        6. screenshot-1.png
          216 kB
          Steven Lowenthal

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            stefania Stefania Alborghetti Assign to me
            respringer Ryan Springer
            Stefania Alborghetti
            Sam Tunnicliffe
            Russ Hatch Russ Hatch
            Votes:
            3 Vote for this issue
            Watchers:
            17 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment