Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
we had a gpu cluster, jobs with bigger resource request couldn't be satisfied for node is running the jobs with smaller resource request. we didn't open reserve system because gpu jobs may run days or weeks. we expect scheduler allocate containers to fill the node , then there will be resource to run jobs with big resource request.