Resolution: Not A Problem
Affects Version/s: None
Fix Version/s: None
Say you have the following task config. Note all processes have max_failure = 1.
Say we kill one these thermos processes. In this case, the process gets restarted since it technically did not crash/fail. Even if you kill it with `kill -SIGSEGV <pid>` it still comes back up again and the number of failures is 0. This is being registered as the process being lost and that number correctly increases.
I think it makes sense to check the exit code on a process kill and count it a failure the err code is not `0`.
Note that if one the processes fails / crashes it is handled differently:
We can just check the `ProcessStatus.return_code` and act accordingly.