I have a server with two GPU, and i want to use only one of them within yarn cluster.
according to hadoop document, i set configs:
then i running following command to test:
iI expected gpu with minor number 0 will not visible to container, but in the launched container, nvidia-smi print two gpu information.
I check the related source code and find it is a bug.
the problem is:
when you specify allowed-gpu-devices, GpuDiscoverer will populate usable gpus from it,
then when assign to a container some of the gpus, it will set denied gpus for the container,
but it never consider excluded gpu of the host.