Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
1. Extend BinaryOp to enable GPU for solve
2. Add MatrixMatrixBuiltinGPUInstruction and use JCuBlas2's cublasDtrsmBatched and cublasDgeqrfBatched (or cublasDgetrfBatched) methods.
For reference implementation, see https://github.com/apache/incubator-systemml/blob/master/src/main/java/org/apache/sysml/runtime/matrix/data/LibCommonsMath.java#L97