Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-7560

'nodetool repair -pr' leads to indefinitely hanging AntiEntropySession

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Normal
    • Resolution: Fixed
    • 2.0.10
    • None
    • None
    • Normal

    Description

      Running nodetool repair -pr will sometimes hang on one of the resulting AntiEntropySessions.

      The system logs will show the repair command starting

       INFO [Thread-3079] 2014-07-15 02:22:56,514 StorageService.java (line 2569) Starting repair command #1, repairing 256 ranges for keyspace x
      

      You can then see a few AntiEntropySessions completing with:

      INFO [AntiEntropySessions:2] 2014-07-15 02:28:12,766 RepairSession.java (line 282) [repair #eefb3c30-0bc6-11e4-83f7-a378978d0c49] session completed successfully
      

      Finally we reach an AntiEntropySession at some point that hangs just before requesting the merkle trees for the next column family in line for repair. So we first see the previous CF being finished and the whole repair sessions hangs here with no visible progress or errors on this or any of the related nodes.

      INFO [AntiEntropyStage:1] 2014-07-15 02:38:20,325 RepairSession.java (line 221) [repair #8f85c1b0-0bc8-11e4-83f7-a378978d0c49] previous_cf is fully synced
      

      Notes:

      • Single DC 6 node cluster with an average load of 86 GB per node.
      • This appears to be random; it does not always happen on the same CF or on the same session.

      Attachments

        1. nodetool_command.log
          7 kB
          Vladimir Avram
        2. cassandra_daemon.log
          309 kB
          Vladimir Avram
        3. cassandra_daemon_rep2.log
          285 kB
          Vladimir Avram
        4. cassandra_daemon_rep1.log
          312 kB
          Vladimir Avram
        5. 0001-partial-backport-3569.patch
          5 kB
          Yuki Morishita
        6. 0001-backport-CASSANDRA-6747.patch
          13 kB
          Yuki Morishita

        Issue Links

          Activity

            People

              yukim Yuki Morishita
              vladmore Vladimir Avram
              Yuki Morishita
              Joshua McKenzie
              Votes:
              1 Vote for this issue
              Watchers:
              13 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: