Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Abandoned
-
None
-
None
-
None
-
None
Description
Docker image with latest tag can run in YARN cluster without any validation in node managers. If a image with latest tag is changed during containers launch. It might produce inconsistent results between nodes. This is surfaced toward end of development for YARN-9184 to keep docker image consistent within a job. One of the ideas to keep :latest tag consistent for a job, is to use docker image command to figure out the image id and use image id to propagate to rest of the container requests. There are some challenges to overcome:
- The latest tag does not exist on the node where first container starts. The first container will need to download the latest image, and find image ID. This can introduce lag time for other containers to start.
- If image id is used to start other container, container-executor may have problems to check if the image is coming from a trusted source. Both image name and ID must be supply through .cmd file to container-executor. However, hacker can supply incorrect image id and defeat container-executor security checks.
If we can over come those challenges, it maybe possible to keep docker image consistent with one application.