[CLOUDSTACK-10106] GPU/vGPU Support on VMWare - ASF JIRA

ASF GitHub Bot added a comment - 18/Sep/18 10:34

rhtyd commented on issue #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-422342539

Test LGTM, but I think the code could be further refactored. I'm unable to find time to do that @nitin-maharana but please see parts of the code where you see several ifs-elses, see how you can improve that and also add more tests?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

ASF GitHub Bot added a comment - 18/Sep/18 10:34 rhtyd commented on issue #2340: CLOUDSTACK-10106 : GPU/vGPU Support on VMware URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-422342539 Test LGTM, but I think the code could be further refactored. I'm unable to find time to do that @nitin-maharana but please see parts of the code where you see several ifs-elses, see how you can improve that and also add more tests? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 21:28

blueorangutan commented on issue #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419570623

<b>Trillian test result (tid-2988)</b>
Environment: vmware-65 (x2), Advanced Networking with Mgmt server 7
Total time taken: 31660 seconds
Marvin logs: https://github.com/blueorangutan/acs-prs/releases/download/trillian/pr2340-t2988-vmware-65.zip
Intermittent failure detected: /marvin/tests/smoke/test_deploy_virtio_scsi_vm.py
Intermittent failure detected: /marvin/tests/smoke/test_public_ip_range.py
Intermittent failure detected: /marvin/tests/smoke/test_templates.py
Intermittent failure detected: /marvin/tests/smoke/test_usage.py
Intermittent failure detected: /marvin/tests/smoke/test_volumes.py
Smoke tests completed. 65 look OK, 4 have error(s)
Only failed tests results shown below:

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 21:28 blueorangutan commented on issue #2340: CLOUDSTACK-10106 : GPU/vGPU Support on VMware URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419570623 <b>Trillian test result (tid-2988)</b> Environment: vmware-65 (x2), Advanced Networking with Mgmt server 7 Total time taken: 31660 seconds Marvin logs: https://github.com/blueorangutan/acs-prs/releases/download/trillian/pr2340-t2988-vmware-65.zip Intermittent failure detected: /marvin/tests/smoke/test_deploy_virtio_scsi_vm.py Intermittent failure detected: /marvin/tests/smoke/test_public_ip_range.py Intermittent failure detected: /marvin/tests/smoke/test_templates.py Intermittent failure detected: /marvin/tests/smoke/test_usage.py Intermittent failure detected: /marvin/tests/smoke/test_volumes.py Smoke tests completed. 65 look OK, 4 have error(s) Only failed tests results shown below: Test | Result | Time (s) | Test File — | — | — | — ContextSuite context=TestDeployVirtioSCSIVM>:teardown | `Error` | 0.00 | test_deploy_virtio_scsi_vm.py test_04_extract_template | `Failure` | 138.61 | test_templates.py ContextSuite context=TestISOUsage>:setup | `Error` | 0.00 | test_usage.py test_06_download_detached_volume | `Failure` | 162.07 | test_volumes.py ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 12:13

blueorangutan commented on issue #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419420642

@rhtyd a Trillian-Jenkins test job (centos7 mgmt + vmware-65) has been kicked to run smoke tests

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 12:13 blueorangutan commented on issue #2340: CLOUDSTACK-10106 : GPU/vGPU Support on VMware URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419420642 @rhtyd a Trillian-Jenkins test job (centos7 mgmt + vmware-65) has been kicked to run smoke tests ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 12:13

rhtyd commented on issue #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419420575

@blueorangutan test centos7 vmware-65

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 12:13 rhtyd commented on issue #2340: CLOUDSTACK-10106 : GPU/vGPU Support on VMware URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419420575 @blueorangutan test centos7 vmware-65 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 12:04

blueorangutan commented on issue #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419418616

Packaging result: ✔centos6 ✔centos7 ✔debian. JID-2288

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 12:04 blueorangutan commented on issue #2340: CLOUDSTACK-10106 : GPU/vGPU Support on VMware URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419418616 Packaging result: ✔centos6 ✔centos7 ✔debian. JID-2288 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 11:37

blueorangutan commented on issue #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419412954

@rhtyd a Jenkins job has been kicked to build packages. I'll keep you posted as I make progress.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 11:37 blueorangutan commented on issue #2340: CLOUDSTACK-10106 : GPU/vGPU Support on VMware URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419412954 @rhtyd a Jenkins job has been kicked to build packages. I'll keep you posted as I make progress. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 11:36

rhtyd commented on issue #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419412763

@blueorangutan package

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

ASF GitHub Bot added a comment - 07/Sep/18 11:36 rhtyd commented on issue #2340: CLOUDSTACK-10106 : GPU/vGPU Support on VMware URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-419412763 @blueorangutan package ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org

ASF GitHub Bot added a comment - 21/Aug/18 11:14

nitin-maharana commented on issue #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-414639173

@rhtyd, I am sorry, somehow I missed your comment. Rebased against latest master. Thank You!

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

ASF GitHub Bot added a comment - 21/Aug/18 11:14 nitin-maharana commented on issue #2340: CLOUDSTACK-10106 : GPU/vGPU Support on VMware URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-414639173 @rhtyd, I am sorry, somehow I missed your comment. Rebased against latest master. Thank You! ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org

ASF GitHub Bot added a comment - 14/Aug/18 08:16

rhtyd commented on issue #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-412791998

@nitin-maharana can you rebase against latest master, thanks.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

ASF GitHub Bot added a comment - 14/Aug/18 08:16 rhtyd commented on issue #2340: CLOUDSTACK-10106 : GPU/vGPU Support on VMware URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-412791998 @nitin-maharana can you rebase against latest master, thanks. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org

ASF GitHub Bot added a comment - 08/Aug/18 06:49

rhtyd commented on issue #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-411304525

@nitin-maharana can you rebase against latest master, thanks.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

ASF GitHub Bot added a comment - 08/Aug/18 06:49 rhtyd commented on issue #2340: CLOUDSTACK-10106 : GPU/vGPU Support on VMware URL: https://github.com/apache/cloudstack/pull/2340#issuecomment-411304525 @nitin-maharana can you rebase against latest master, thanks. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org

ASF GitHub Bot added a comment - 28/Dec/17 09:36

rhtyd commented on a change in pull request #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#discussion_r158919029

##########
File path: vmware-base/src/com/cloud/hypervisor/vmware/mo/HostMO.java
##########
@@ -1184,4 +1206,261 @@ public ManagedObjectReference waitForPortGroup(String networkName, long timeOutM
}
return morNetwork;
}
+
+ public ManagedObjectReference getComputeResourceEnvironmentBrowser() throws Exception

{ + ManagedObjectReference morParent = getParentMor(); + ClusterMO clusterMo = new ClusterMO(_context, morParent); + return clusterMo.getComputeResourceEnvironmentBrowser(); + }

+
+ public VirtualMachinePciPassthroughInfo getHostPciDeviceInfo(final String pciDeviceId) throws Exception {
+ VirtualMachinePciPassthroughInfo matchingPciPassthroughDevice = null;
+ ConfigTarget configTarget = _context.getService().queryConfigTarget(getComputeResourceEnvironmentBrowser(), _mor);
+ List<VirtualMachinePciPassthroughInfo> pciPassthroughDevices = configTarget.getPciPassthrough();
+ for (VirtualMachinePciPassthroughInfo pciPassthroughDevice : pciPassthroughDevices) {
+ HostPciDevice hostPciDevice = pciPassthroughDevice.getPciDevice();
+ if (pciDeviceId.equals(hostPciDevice.getId()))

{ + matchingPciPassthroughDevice = pciPassthroughDevice; + break; + }

+ }
+ return matchingPciPassthroughDevice;
+ }
+
+ public VirtualDevice prepareSharedPciPassthroughDevice(final String vGpuProfile)

{ + s_logger.debug("Preparing shared PCI device"); + VirtualPCIPassthrough virtualPciPassthrough = new VirtualPCIPassthrough(); + VirtualPCIPassthroughVmiopBackingInfo virtualPCIPassthroughVmiopBackingInfo = new VirtualPCIPassthroughVmiopBackingInfo(); + virtualPCIPassthroughVmiopBackingInfo.setVgpu(vGpuProfile); + virtualPciPassthrough.setBacking(virtualPCIPassthroughVmiopBackingInfo); + Description description = new Description(); + description.setLabel("vGPU device"); + description.setSummary("vGPU type: " + vGpuProfile); + virtualPciPassthrough.setDeviceInfo(description); + return virtualPciPassthrough; + }

+
+ public VirtualDevice prepareDirectPciPassthroughDevice(final VirtualMachinePciPassthroughInfo hostPciDeviceInfo)

{ + // Ex: pciDeviceId is like "0000:08:00.0" composed of bus,slot,function + s_logger.debug("Preparing direct PCI device"); + + VirtualPCIPassthrough pciDevice = new VirtualPCIPassthrough(); + VirtualPCIPassthroughDeviceBackingInfo pciBacking = new VirtualPCIPassthroughDeviceBackingInfo(); + pciBacking.setId(hostPciDeviceInfo.getPciDevice().getId()); + pciBacking.setDeviceId(Integer.toHexString(hostPciDeviceInfo.getPciDevice().getDeviceId())); + pciBacking.setDeviceName(hostPciDeviceInfo.getPciDevice().getDeviceName()); + pciBacking.setVendorId(hostPciDeviceInfo.getPciDevice().getVendorId()); + pciBacking.setSystemId(hostPciDeviceInfo.getSystemId()); + pciDevice.setBacking(pciBacking); + return pciDevice; + }

+
+ public String getPciIdForAvailableDirectPciPassthroughDevice() throws Exception {
+ String pciId = "";
+ List<HostGraphicsInfo> hostGraphicsInfos = getHostGraphicsInfo();
+ for (HostGraphicsInfo hostGraphicsInfo : hostGraphicsInfos) {
+ if (GPU.GPUType.direct.toString().equalsIgnoreCase(hostGraphicsInfo.getGraphicsType())) {
+ List<ManagedObjectReference> vms = hostGraphicsInfo.getVm();
+ if (CollectionUtils.isEmpty(vms))

{ + pciId = hostGraphicsInfo.getPciId(); + break; + }

+ }
+ }
+ return pciId;
+ }
+
+ /**
+ * It updates the info of each vGPU type in the NVidia GRID K1/K2 Card.
+ * @param gpuCapacity (The output is stored in this)
+ * @param groupName - (NVIDIAGRID K1 or NVIDIAGRID K2)
+ * @param countGridKSharedGPUs (Number of Enabled shared GPUs)
+ * @param graphicsInfo (Info regarding the card)
+ * @param sharedPassthruGpuTypes (All the enabled vGPU types)
+ * @param gridKGPUMemory (Video RAM of each GPU in the card)
+ * @throws Exception
+ */
+ private void updateGpuCapacities(final HashMap<String, VgpuTypesInfo> gpuCapacity, final String groupName, final long countGridKSharedGPUs, final List<HostGraphicsInfo> graphicsInfo, final List<String> sharedPassthruGpuTypes, final long gridKGPUMemory) throws Exception {
+ /*
+ * 0 - grid_k100 or grid_k200
+ * 1 - grid_k120q or grid_k220q
+ * 2 - grid_k140q or grid_k240q
+ * 3 - grid_k160q or grid_k260q
+ * 4 - grid_k180q or grid_k280q
+ */
+ final long remainingCapacities[] = new long[5];
+
+ remainingCapacities[0] = 8l * countGridKSharedGPUs;
+ remainingCapacities[1] = 8l * countGridKSharedGPUs;
+ remainingCapacities[2] = 4l * countGridKSharedGPUs;
+ remainingCapacities[3] = 2l * countGridKSharedGPUs;
+ remainingCapacities[4] = countGridKSharedGPUs;
+
+ for (final HostGraphicsInfo graphicInfo : graphicsInfo) {
+ if (graphicInfo.getDeviceName().equals(groupName) && graphicInfo.getGraphicsType().equals("shared")) {
+ if (CollectionUtils.isNotEmpty(graphicInfo.getVm())) {
+ String vgpuType = "None";
+
+ for (ManagedObjectReference mor : graphicInfo.getVm()) {
+ final VirtualMachineMO vmMo = new VirtualMachineMO(_context, mor);
+
+ if (vgpuType.equals("None") && vmMo != null && vmMo.getConfigInfo() != null && vmMo.getConfigInfo().getHardware() != null) {
+ final List<VirtualDevice> devices = vmMo.getConfigInfo().getHardware().getDevice();
+
+ for (VirtualDevice device : devices) {
+ if (device instanceof VirtualPCIPassthrough) {
+ if (device.getBacking() != null && (device.getBacking() instanceof VirtualPCIPassthroughVmiopBackingInfo)) {
+ final VirtualPCIPassthroughVmiopBackingInfo backingInfo = (VirtualPCIPassthroughVmiopBackingInfo) device.getBacking();
+
+ if (backingInfo.getVgpu() != null)

{ + vgpuType = backingInfo.getVgpu(); + break; + }

+ }
+ }
+ }
+ }
+ }
+
+ // If GRID K1, then search for only K1 vGPU types. Same for GRID K2.
+ // The remaining capacity of one type affects other vGPU type capacity.
+ // Each GPU should always contain one type of vGPU VMs.
+ if ((groupName.equals("NVIDIAGRID K1") && vgpuType.equals("grid_k100")) || (groupName.equals("NVIDIAGRID K2") && vgpuType.equals("grid_k200"))) {

Review comment:
@nitin-maharana my suggestion is around refactoring, all the ifs specific to a graphics card type/name could be wrapped to a class where the behavior settings/values could be moved. It will make it easier in future to add more graphics cards, maintain code etc.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

ASF GitHub Bot added a comment - 28/Dec/17 09:36 rhtyd commented on a change in pull request #2340: CLOUDSTACK-10106 : GPU/vGPU Support on VMware URL: https://github.com/apache/cloudstack/pull/2340#discussion_r158919029 ########## File path: vmware-base/src/com/cloud/hypervisor/vmware/mo/HostMO.java ########## @@ -1184,4 +1206,261 @@ public ManagedObjectReference waitForPortGroup(String networkName, long timeOutM } return morNetwork; } + + public ManagedObjectReference getComputeResourceEnvironmentBrowser() throws Exception { + ManagedObjectReference morParent = getParentMor(); + ClusterMO clusterMo = new ClusterMO(_context, morParent); + return clusterMo.getComputeResourceEnvironmentBrowser(); + } + + public VirtualMachinePciPassthroughInfo getHostPciDeviceInfo(final String pciDeviceId) throws Exception { + VirtualMachinePciPassthroughInfo matchingPciPassthroughDevice = null; + ConfigTarget configTarget = _context.getService().queryConfigTarget(getComputeResourceEnvironmentBrowser(), _mor); + List<VirtualMachinePciPassthroughInfo> pciPassthroughDevices = configTarget.getPciPassthrough(); + for (VirtualMachinePciPassthroughInfo pciPassthroughDevice : pciPassthroughDevices) { + HostPciDevice hostPciDevice = pciPassthroughDevice.getPciDevice(); + if (pciDeviceId.equals(hostPciDevice.getId())) { + matchingPciPassthroughDevice = pciPassthroughDevice; + break; + } + } + return matchingPciPassthroughDevice; + } + + public VirtualDevice prepareSharedPciPassthroughDevice(final String vGpuProfile) { + s_logger.debug("Preparing shared PCI device"); + VirtualPCIPassthrough virtualPciPassthrough = new VirtualPCIPassthrough(); + VirtualPCIPassthroughVmiopBackingInfo virtualPCIPassthroughVmiopBackingInfo = new VirtualPCIPassthroughVmiopBackingInfo(); + virtualPCIPassthroughVmiopBackingInfo.setVgpu(vGpuProfile); + virtualPciPassthrough.setBacking(virtualPCIPassthroughVmiopBackingInfo); + Description description = new Description(); + description.setLabel("vGPU device"); + description.setSummary("vGPU type: " + vGpuProfile); + virtualPciPassthrough.setDeviceInfo(description); + return virtualPciPassthrough; + } + + public VirtualDevice prepareDirectPciPassthroughDevice(final VirtualMachinePciPassthroughInfo hostPciDeviceInfo) { + // Ex: pciDeviceId is like "0000:08:00.0" composed of bus,slot,function + s_logger.debug("Preparing direct PCI device"); + + VirtualPCIPassthrough pciDevice = new VirtualPCIPassthrough(); + VirtualPCIPassthroughDeviceBackingInfo pciBacking = new VirtualPCIPassthroughDeviceBackingInfo(); + pciBacking.setId(hostPciDeviceInfo.getPciDevice().getId()); + pciBacking.setDeviceId(Integer.toHexString(hostPciDeviceInfo.getPciDevice().getDeviceId())); + pciBacking.setDeviceName(hostPciDeviceInfo.getPciDevice().getDeviceName()); + pciBacking.setVendorId(hostPciDeviceInfo.getPciDevice().getVendorId()); + pciBacking.setSystemId(hostPciDeviceInfo.getSystemId()); + pciDevice.setBacking(pciBacking); + return pciDevice; + } + + public String getPciIdForAvailableDirectPciPassthroughDevice() throws Exception { + String pciId = ""; + List<HostGraphicsInfo> hostGraphicsInfos = getHostGraphicsInfo(); + for (HostGraphicsInfo hostGraphicsInfo : hostGraphicsInfos) { + if (GPU.GPUType.direct.toString().equalsIgnoreCase(hostGraphicsInfo.getGraphicsType())) { + List<ManagedObjectReference> vms = hostGraphicsInfo.getVm(); + if (CollectionUtils.isEmpty(vms)) { + pciId = hostGraphicsInfo.getPciId(); + break; + } + } + } + return pciId; + } + + /** + * It updates the info of each vGPU type in the NVidia GRID K1/K2 Card. + * @param gpuCapacity (The output is stored in this) + * @param groupName - (NVIDIAGRID K1 or NVIDIAGRID K2) + * @param countGridKSharedGPUs (Number of Enabled shared GPUs) + * @param graphicsInfo (Info regarding the card) + * @param sharedPassthruGpuTypes (All the enabled vGPU types) + * @param gridKGPUMemory (Video RAM of each GPU in the card) + * @throws Exception + */ + private void updateGpuCapacities(final HashMap<String, VgpuTypesInfo> gpuCapacity, final String groupName, final long countGridKSharedGPUs, final List<HostGraphicsInfo> graphicsInfo, final List<String> sharedPassthruGpuTypes, final long gridKGPUMemory) throws Exception { + /* + * 0 - grid_k100 or grid_k200 + * 1 - grid_k120q or grid_k220q + * 2 - grid_k140q or grid_k240q + * 3 - grid_k160q or grid_k260q + * 4 - grid_k180q or grid_k280q + */ + final long remainingCapacities[] = new long [5] ; + + remainingCapacities [0] = 8l * countGridKSharedGPUs; + remainingCapacities [1] = 8l * countGridKSharedGPUs; + remainingCapacities [2] = 4l * countGridKSharedGPUs; + remainingCapacities [3] = 2l * countGridKSharedGPUs; + remainingCapacities [4] = countGridKSharedGPUs; + + for (final HostGraphicsInfo graphicInfo : graphicsInfo) { + if (graphicInfo.getDeviceName().equals(groupName) && graphicInfo.getGraphicsType().equals("shared")) { + if (CollectionUtils.isNotEmpty(graphicInfo.getVm())) { + String vgpuType = "None"; + + for (ManagedObjectReference mor : graphicInfo.getVm()) { + final VirtualMachineMO vmMo = new VirtualMachineMO(_context, mor); + + if (vgpuType.equals("None") && vmMo != null && vmMo.getConfigInfo() != null && vmMo.getConfigInfo().getHardware() != null) { + final List<VirtualDevice> devices = vmMo.getConfigInfo().getHardware().getDevice(); + + for (VirtualDevice device : devices) { + if (device instanceof VirtualPCIPassthrough) { + if (device.getBacking() != null && (device.getBacking() instanceof VirtualPCIPassthroughVmiopBackingInfo)) { + final VirtualPCIPassthroughVmiopBackingInfo backingInfo = (VirtualPCIPassthroughVmiopBackingInfo) device.getBacking(); + + if (backingInfo.getVgpu() != null) { + vgpuType = backingInfo.getVgpu(); + break; + } + } + } + } + } + } + + // If GRID K1, then search for only K1 vGPU types. Same for GRID K2. + // The remaining capacity of one type affects other vGPU type capacity. + // Each GPU should always contain one type of vGPU VMs. + if ((groupName.equals("NVIDIAGRID K1") && vgpuType.equals("grid_k100")) || (groupName.equals("NVIDIAGRID K2") && vgpuType.equals("grid_k200"))) { Review comment: @nitin-maharana my suggestion is around refactoring, all the ifs specific to a graphics card type/name could be wrapped to a class where the behavior settings/values could be moved. It will make it easier in future to add more graphics cards, maintain code etc. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org

ASF GitHub Bot added a comment - 27/Dec/17 11:26

nitin-maharana commented on a change in pull request #2340: CLOUDSTACK-10106: GPU/vGPU Support on VMware
URL: https://github.com/apache/cloudstack/pull/2340#discussion_r158800362