Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Implemented
-
2.8.1
-
None
-
CentOS
Description
In some servers, When I use Docker on LinuxContainer, I found the aciton that AppMaster register to Resourcemanager failed. But didn't happen in other servers.
I found the pclose (in container-executor.c) return different value in different server, even though the process which is launched by popen is running normally. Some server return 0, and others return 13.
Because yarn regard the application as failed application when pclose return nonzero, and yarn will remove the AMRMToken, then the AppMaster register failed because Resourcemanager have removed this applicaiton's token.
In container-executor.c, the judgement condition is whether the return code is zero. But man the pclose, the document tells that "pclose return -1" represent wrong. So I change the judgement condition, then slove this problem.
Attachments
Attachments
Issue Links
- Is contained by
-
YARN-7654 Support ENTRY_POINT for docker container
- Resolved