[SYSTEMDS-446] Phase 1: Exploit GPU BLAS libraries (integration) - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Epic
Status: Reopened
Priority: Major
Resolution: Unresolved
Affects Version/s: SystemML 0.13, SystemML 0.14
Fix Version/s: SystemML 1.1
Component/s: Compiler, Runtime, Test
Labels:
None

Attachments

Sub-Tasks

1.	Implement functionality to transfer CP sparse matrixblock to GPU (and back)	Closed	Nakul Jindal
2.	Implement GPU sparse matrix multiplication	Closed	Nakul Jindal
3.	Add bufferpool integration logic to CUDA backend	Closed	Niketan Pansare
4.	Implement GPU dense matrix multiplication	Closed	Niketan Pansare
5.	Add GPU instructions that utilizes CuDNN v4's conv2d and pooling related functions	Closed	Unassigned
6.	Implement MMTSJGPUInstruction instruction for GPU backend along with corresponding Hops/Lops	Closed	Tanuj Kr Aasawat
7.	Error while allocating CSRPointer	Resolved	Nakul Jindal
8.	Add support for cusparse geam	Closed	Nakul Jindal
9.	Add support for cusparse axpy	Closed	Nakul Jindal
10.	Improve the performance of sparse TSMM either by using/implement sparse dsyrk	Open	Nakul Jindal
11.	LibMatrixCUDA's vectorScalarMultiply() produces incorrect results.	Open	Unassigned
12.	Make sparse memory estimation robust by handling unknown nnz.	Open	Nakul Jindal
13.	Conduct initial performance experiments for mat mult	Closed	Nakul Jindal
14.	Enable setting GPU from MLContext (and related APIs)	Closed	Nakul Jindal
15.	Create documentation explaining setup/usage for the GPU backend	Closed	Niketan Pansare
16.	Add LU and QR functionality to GPU backend	Open	Nakul Jindal
17.	Implement solve builtin function using cublas kernels	Closed	Nakul Jindal
18.	Add support for aggregate unary operations on GPU	Closed	Unassigned
19.	Implement relu_maxpooling instruction for GPU	Closed	Niketan Pansare
20.	Implement conv2d_bias_add instruction for GPU	Closed	Niketan Pansare
21.	Implement conv2d_bias_add instruction for GPU	Closed	Niketan Pansare
22.	Implement Mathematical and Trigonometric Built-In Functions on GPU	Closed	Nakul Jindal
23.	Add support for matrix-vector GPU axpy operation	Resolved	Niketan Pansare
24.	Support GPU via Python APIs	Resolved	Niketan Pansare
25.	Support alternative algorithms for CuDNN operators such as convolution	Open	Niketan Pansare
26.	Support fused weight update operators (similar to codegen on CP)	Open	Unassigned
27.	Add (Unit) Tests for GPU functions	Closed	Nakul Jindal
28.	Fix the need to add force to -gpu always	Closed	Nakul Jindal
29.	Add additional binary element wise operations	Closed	Nakul Jindal
30.	Add relational operators for GPU	Closed	Nakul Jindal
31.	Add cbind (and rbind) GPU ops	Closed	Nakul Jindal
32.	Support left and right indexing on GPU	Open	Niketan Pansare

Activity

People

Assignee:: Niketan Pansare

Reporter:: Matthias Boehm

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 10/Jan/16 04:12

Updated:: 21/Dec/17 06:05