Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-14679

Prevent generating new tokens on a node when data exists

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Urgent
    • Resolution: Unresolved
    • None
    • None
    • Critical

    Description

      Data loss is possible if a node starts up without system.local data available.

      If a node restarts and its system.local data is unavailable it will generate new tokens. This will cause range movements in the cluster causing potential data loss, as these range movements are not part of a bootstrap/decommission and leaves orphaned data around the cluster.

      This can happen if a node restarts without a JBOD entry available, or if the cassandra.yaml changes and leaves a JBOD entry out.

      If a node starts up, finds data but not its system.local it should not generate new tokens. Neither should it assign itself a new Host ID.

      This is described in more detail in http://thelastpickle.com/blog/2018/08/22/the-fine-print-when-using-multiple-data-directories.html

      Attachments

        Activity

          People

            Unassigned Unassigned
            mck Michael Semb Wever
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: