[CASSANDRA-9779] Append-only optimization - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Open
Priority: Normal
Resolution: Unresolved
Fix Version/s: 5.x
Component/s: Feature/Materialized Views, Legacy/CQL
Labels:
None

Description

Many common workloads are append-only: that is, they insert new rows but do not update existing ones. However, Cassandra has no way to infer this and so it must treat all tables as if they may experience updates in the future.

If we added syntax to tell Cassandra about this (WITH INSERTS ONLY for instance) then we could do a number of optimizations:

Compaction would only need to worry about defragmenting partitions, not rows. We could default to DTCS or similar.
CollationController could stop scanning sstables as soon as it finds a matching row
Most importantly, materialized views wouldn't need to worry about deleting prior values, which would eliminate the majority of the MV overhead

Attachments

Issue Links

is related to

CASSANDRA-9420 Table option for promising that you will never touch a column twice

Open

CASSANDRA-10295 Support skipping MV read-before-write on a per-operation basis

Open

CASSANDRA-6477 Materialized Views (was: Global Indexes)

Resolved

relates to

CASSANDRA-12852 Add allow_deletes table schema option, which defaults to True

Open

Sub-Tasks

Fast streaming with materialized views

Open

Benjamin Roth

Activity

People

Assignee:: Unassigned

Reporter:: Jonathan Ellis

Votes:: 9 Vote for this issue

Watchers:: 37 Start watching this issue

Dates

Created:: 11/Jul/15 16:13

Updated:: 07/Mar/23 10:53