Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-18555

Expose decommission state to nodetool info

    XMLWordPrintableJSON

Details

    Description

      Currently, when a node is being decommissioned and if any failure happens, then an exception is thrown back to the caller.

      But Cassandra's decommission takes considerable time ranging from minutes to hours to days. There are various scenarios in that the caller may need to probe the status again:

      • The caller times out
      • It is not possible to keep the caller hanging for such a long time

      And If the caller does not know what happened internally, then it cannot retry, etc., leading to other issues.

      So, in this ticket, I am going to add a new nodetool/JMX command that can be invoked by the caller anytime, and it will return the correct status.

      It might look like a smaller change, but when we need to operate Cassandra at scale in a large-scale fleet, then this becomes a bottleneck and require constant operator intervention.

      Attachments

        Issue Links

          Activity

            People

              chovatia.jaydeep@gmail.com Jaydeepkumar Chovatia
              chovatia.jaydeep@gmail.com Jaydeepkumar Chovatia
              Jaydeepkumar Chovatia, Stefan Miklosovic
              Brandon Williams, Maxwell Guo
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 7h
                  7h