Details
-
Sub-task
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
Description
For LU: See JCublas2's cublasDgetrfBatched method
For QR: See JCublas2's cublasDgeqrfBatched method
The key changes required:
1. Add GPU backend in https://github.com/apache/incubator-systemml/blob/master/src/main/java/org/apache/sysml/hops/FunctionOp.java#L239
2. Add MultiReturnBuiltinGPUInstruction that invokes above functions either directly or through LibMatrixCUDA.
nakul02 Do you want to take a pass at this ?