Details
-
Improvement
-
Status: Open
-
Normal
-
Resolution: Unresolved
-
None
-
None
Description
Since a cluster snapshot will contain rf copies of each piece of data, bulkloading all of those snapshots will create rf^2 copies of each piece of data.
Not sure what the solution here is. Ideally we would merge the RF copies of the data before sending to the cluster. This would solve any inconsistencies that existed when the snapshot was taken.
A more naive approach of only loading one of the RF copies and assuming there are no inconsistencies might be an easier goal for the near term though.
Attachments
Issue Links
- is duplicated by
-
CASSANDRA-10757 Cluster migration with sstableloader requires significant compaction time
- Resolved
- is related to
-
CASSANDRA-6448 Give option to stream just primary replica via sstableloader
- Open