[YARN-3678] DelayedProcessKiller may kill other process other than container - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Critical
Resolution: Duplicate
Affects Version/s: 2.6.0, 2.7.2
Fix Version/s: None
Component/s: nodemanager
Labels:
None

Description

Suppose one container finished, then it will do clean up, the PID file still exist and will trigger once singalContainer, this will kill the process with the pid in PID file, but as container already finished, so this PID may be occupied by other process, this may cause serious issue.
As I know, my NM was killed unexpectedly, what I described can be the cause. Even rarely occur.

Attachments

Issue Links

duplicates

YARN-4459 container-executor should only kill process groups

Closed

is related to

YARN-6276 Now container kill mechanism may lead process leak

Open

links to

GitHub Pull Request #20

Activity

People

Assignee:: Unassigned

Reporter:: gu-chi

Votes:: 0 Vote for this issue

Watchers:: 18 Start watching this issue

Dates

Created:: 19/May/15 13:00

Updated:: 26/Jul/19 11:19

Resolved:: 11/May/16 02:08