Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-9702

Repair running really slow

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Normal
    • Resolution: Cannot Reproduce
    • None
    • None
    • C* 2.1.7, Debian Wheezy

    • Normal

    Description

      We're using 2.1.x since the very beginning and we always had problem with failing or slow repair. In one data center we aren't able to finish repair for many weeks (partially because CASSANDRA-9681 as we needed to reboot nodes periodically).

      I've launched it today morning (12 hours now) and monitor using https://github.com/spotify/cassandra-opstools/blob/master/bin/spcassandra-repairstats. For the first hour it progressed to 9.43% but then it took ~10 hours to reach 9.44%. I see very rarely logs related to repair (each 15-20 minutes but sometimes nothing new for 1 hour).

      Repair launched with:

      nodetool repair --partitioner-range --parallel --in-local-dc {keyspace}
      

      Attached log file from today.

      We've ~4.1TB of data in 12 nodes with RF set to 3 (2 DC with 6 nodes each).

      Attachments

        1. db1.system.log
          8.42 MB
          mlowicki

        Activity

          People

            Unassigned Unassigned
            mlowicki mlowicki
            Votes:
            2 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: