Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
Private Beta
-
None
Description
Looked a bit at the tablet servers in the YCSB cluster today, after the cluster had hit about 1TB of data each (about 789 tablets per server). YCSB is getting slower and slower, apparently because a lot of RAM is being used up by tables that were written days ago. Looking at the maintenance manager dashboard, we are prioritizing compactions, and never flushing MRS even from tablets that haven't taken an insert in days.
I think once the total number of tablets grows a bit more, we'll be in a situation where we won't flush them, but we're basically out of RAM, and everything will grind to a halt.