Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-16446

Parent repair sessions leak may lead to node long pauses

    XMLWordPrintableJSON

Details

    Description

      ActiveRepairService keeps a map `parentRepairSessions`. If these sessions leak, that map can grow to a size when a node restarts ActiveRepairService.onRestart() triggers a cleanup of sessions that can pause nodes in a cluster for a long time.

      The proposed solution is for repairs to cleanup these sessions on all nodes on completion by sending a CLEANUP message to involved nodes. Tests rely on a new parentRepairSessionsCount() method on the parent repair sessions MBean to keep track of these.

      Attachments

        Issue Links

          Activity

            People

              bereng Berenguer Blasi
              bereng Berenguer Blasi
              Berenguer Blasi
              Andres de la Peña, Ekaterina Dimitrova
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h
                  1h