I've deployed cluster with 3000 agents and started my little test script on it. Script is doing only three actions in loop: 1) Stop All services 2) Start All services 3) Update zk config.
After few days of work these actions need much more time to be executed. For example before stop/start all actions took near 7-8 minutes. Now they need near 20-30 minutes