Running nodetool repair repeatedly with the -snapshot parameter results in a native memory leak. The JVM process will take up more and more physical memory until it is killed by the Linux OOM killer.
The command used was as follows:
nodetool repair keyspace -local -snapshot -pr -st start_token -et end_token
Removing the -snapshot flag prevented the memory leak. The subrange repair necessitated multiple repairs, so it made the problem noticeable, but I believe the problem would be reproducible even if you ran repair repeatedly without specifying a start and end token.
Notes from Yuki Morishita:
Probably the cause is too many snapshots. Snapshot sstables are opened during validation, but memories used are freed when releaseReferences called. But since snapshots never get marked compacted, memories never freed.
We only cleanup mmap'd memories when sstable is mark compacted. https://github.com/apache/cassandra/blob/cassandra-1.2/src/java/org/apache/cassandra/io/sstable/SSTableReader.java#L974
Validation compaction never marks snapshots compacted.
|Field||Original Value||New Value|
|Assignee||Yuki Morishita [ yukim ]|
|Fix Version/s||1.2.10 [ 12324948 ]|
|Priority||Major [ 3 ]||Minor [ 4 ]|
|Component/s||Core [ 12312978 ]|
|Status||Open [ 1 ]||Patch Available [ 10002 ]|
|Fix Version/s||2.0.1 [ 12324542 ]|
|Resolution||Fixed [ 1 ]|
|Status||Patch Available [ 10002 ]||Resolved [ 5 ]|
|Transition||Time In Source Status||Execution Times||Last Executer||Last Execution Date|
|18h 17m||1||Yuki Morishita||18/Sep/13 17:05|
|3h 21m||1||Yuki Morishita||18/Sep/13 20:27|