[SYSTEMDS-1436] Improve Sparse matrix support for GPU operations - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Task
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: Runtime
Labels:
- cuda
- deeplearning
- gpu
- gsoc2017
- mentor

Description

SystemML has a preliminary set of GPU implementation for its primitive operations (Matmult, reductions, neural net operations among others). Currently, these GPU operations work when SystemML is run on a single machine (either using the Standalone mode or Spark mode). Programs written in the external DSLs (DML & PyDML) and internal DSLs (Python and Scala) can enable the use of these GPUs.

SystemML is aware of sparsity in matrix blocks and encodes them differently. It has 3 different types of Sparse formats (CSR, COO & a custom MCSR). A lot of the GPU operations are implemented for dense matrix blocks; for some GPU operations, when sparse matrices are encountered, they are first converted to dense and then sent to the GPU.

This project is to implement CUDA kernels for Sparse Matrix blocks
Operations to be implemented include reductions, element-wise operations, neural network operations among others

This project is fairly isolated from the internal compiler & optimizer, therefore a thorough knowledge of the entire system will not be needed.

Knowledge of CUDA programming is preferred.
For a initial implementation, the most efficient CUDA kernel is not required.

Rating - Medium

Mentors - nakul02, (optionally niketanpansare)

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Nakul Jindal

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 24/Mar/17 19:04

Updated:: 27/Mar/17 22:31