Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-8245

Cassandra nodes periodically die in 2-DC configuration

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Low
    • Resolution: Fixed
    • 2.0.12, 2.1.3
    • None
    • None
    • Scientific Linux release 6.5
      java version "1.7.0_51"
      Cassandra 2.0.9

    • Low

    Description

      We have 2 DCs with 3 nodes in each.
      Second DC periodically has 1-2 nodes down.

      Looks like it looses connectivity with another nodes and then Gossiper starts to accumulate tasks until Cassandra dies with OOM.

      WARN [MemoryMeter:1] 2014-08-12 14:34:59,803 Memtable.java (line 470) setting live ratio to maximum of 64.0 instead of Infinity
      WARN [GossipTasks:1] 2014-08-12 14:44:34,866 Gossiper.java (line 637) Gossip stage has 1 pending tasks; skipping status check (no nodes will be marked down)
      WARN [GossipTasks:1] 2014-08-12 14:44:35,968 Gossiper.java (line 637) Gossip stage has 4 pending tasks; skipping status check (no nodes will be marked down)
      WARN [GossipTasks:1] 2014-08-12 14:44:37,070 Gossiper.java (line 637) Gossip stage has 8 pending tasks; skipping status check (no nodes will be marked down)
      WARN [GossipTasks:1] 2014-08-12 14:44:38,171 Gossiper.java (line 637) Gossip stage has 11 pending tasks; skipping status check (no nodes will be marked down)
      ...
      WARN [GossipTasks:1] 2014-10-06 21:42:51,575 Gossiper.java (line 637) Gossip stage has 1014764 pending tasks; skipping status check (no nodes will be marked down)
      WARN New I/O worker #13 2014-10-06 21:54:27,010 Slf4JLogger.java (line 76) Unexpected exception in the selector loop.
      java.lang.OutOfMemoryError: Java heap space

      Also those lines but not sure it is relevant:
      DEBUG [GossipStage:1] 2014-08-12 11:33:18,801 FailureDetector.java (line 338) Ignoring interval time of 2085963047

      Attachments

        1. stack1.txt
          188 kB
          Oleg Poleshuk
        2. stack2.txt
          187 kB
          Oleg Poleshuk
        3. stack3.txt
          195 kB
          Oleg Poleshuk
        4. stack4.txt
          189 kB
          Oleg Poleshuk
        5. stack5.txt
          185 kB
          Oleg Poleshuk

        Activity

          People

            brandon.williams Brandon Williams
            relgames Oleg Poleshuk
            Brandon Williams
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: