Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
None
Description
Each host in the cluster runs ambari-agent.
There should be a Nagios alert to that watches the ambari-agent process. Since the system does not allow direct communication to an ambari-agent, this check should either a) check the process running on the host or b) ping the Ambari Server REST API to confirm agent is still heartbeat'ing.
This alert should be shown with each Hosts >
{host}in Ambari Web.
Service Description: Ambari Agent (ambari-agent) process down
Service Group: AMBARI
Check / Retry Interval: 0.25
Note: need to add new service group AMBARI for Nagios