Details
-
Improvement
-
Status: Closed
-
Minor
-
Resolution: Fixed
-
None
-
None
Description
We've seen nodes fail to kill workers when they the processes end up in a defunct state. I would like a meter to track these failures for alerting.
Attachments
Issue Links
- links to