Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
2.9.0
-
None
-
None
-
None
Description
/cc vvasudev
We have been running some benchmarks internally with resource isolation enabled. We have consistently run into kernel panics when running a large job ( a large pi job, terasort ). These kernel panics wen't away when we set yarn.nodemanager.resource.percentage-physical-cpu-limit=100 . Anything less than 100 triggers different behavior in YARN's CPU resource handler which seems to cause these issues. Looking at the kernel crash dumps, the backtraces were different - sometimes pointing to java processes, sometimes not.
Kernel versions used : 3.10.0-229.14.1.el7.x86_64 and 3.10.0-327.13.1.el7.x86_64 .
Attachments
Issue Links
- relates to
-
YARN-4048 Linux kernel panic under strict CPU limits(on CentOS/RHEL 6.x)
- Open