[SINGA-134] Extend SINGA to run over a GPU cluster - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Component/s: None
Labels:
- GPU
- cluster

Description

Currently SINGA is able to run over a cluster of nodes using CPU and over a single node with multiple GPUs.
This ticket is going to extend SINGA to run over a GPU cluster.
The framework is applicable for such training environment.
We need to update the code for allocating the GPU workers on different nodes and for messaging passing between GPUs on different nodes (refer to ~~SINGA-133~~).

Attachments

Activity

People

Assignee:: wangwei

Reporter:: wangwei

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 12/Jan/16 03:59

Updated:: 10/Jun/16 05:14

Resolved:: 10/Jun/16 05:14