Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
1. Add an new MMTSJGPUInstruction instruction under package org.apache.sysml.runtime.instructions.gpu.
2. Add appropriate hooks at runtime parser. For example:
String2GPUInstructionType.put( "tsmm" , CPINSTRUCTION_TYPE.MMTSJ);
3. Add appropriate hooks at Hops/Lops.
4. Add a new function in org.apache.sysml.runtime.matrix.data.LibMatrixCUDA library to perform tsmm: transposeSelfMatrixMultOperations