Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-4756

Bulk loading snapshots creates RF^2 copies of the data

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Normal
    • Resolution: Unresolved
    • None
    • Legacy/Core
    • None

    Description

      Since a cluster snapshot will contain rf copies of each piece of data, bulkloading all of those snapshots will create rf^2 copies of each piece of data.

      Not sure what the solution here is. Ideally we would merge the RF copies of the data before sending to the cluster. This would solve any inconsistencies that existed when the snapshot was taken.

      A more naive approach of only loading one of the RF copies and assuming there are no inconsistencies might be an easier goal for the near term though.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              nickmbailey Nick Bailey
              Votes:
              2 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated: