[KAFKA-555] Add a key-based log retention strategy - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Duplicate
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

Currently we have two log retention strategies: one based on time and one based on log size. These work well for "event" type data--i.e. data that consists only of appends. However if the events model changes to an underlying keyed data set, a more convenient retention strategy would delete keys that had been overwritten rather than retaining whole segments.

The proposed implementation would be a background process that scanned log segments and recopied only keys that hadn't been overwritten. Some more details are in this wiki:
https://cwiki.apache.org/confluence/display/KAFKA/Keyed+Messages+Proposal

Attachments

Issue Links

duplicates

KAFKA-631 Implement log compaction

Resolved

Activity

People

Assignee:: Jay Kreps

Reporter:: Jay Kreps

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 09/Oct/12 16:09

Updated:: 03/Jul/13 04:04

Resolved:: 03/Jul/13 04:04