Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
Currently containers are marked as lost after a couple of minutes, which is too sensitive for a busy cluster. We should increase the defaults and make the container timeout configurable. We may also want to increase the number of times the agent will retry heartbeating to the AM.
Attachments
Attachments
Issue Links
- breaks
-
SLIDER-1206 AgentFailuresIT and AgentFailures2IT failing due to increase in heartbeat loss interval
- Resolved