Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
2.6.0
-
None
-
None
Description
If we are launching yarn containers within cgroups, linux provides some guarantees that can help completely tear down a container. Specifically, linux guarantees that tasks can't escape a cgroup. We can use this fact to tear down a yarn container without leaking tasks.
Today, a SIGTERM is sent to the session (normally lead by bash). When the session leader exits, the LCE sees this and assumes all resources have been given back to the system. This is not guaranteed. Example: YARN-2809 implements a workaround that is only necessary because tasks are still lingering within the cgroup when the nodemanager attempts to delete it.