[SINGA-381] Update the autograd API to yeild the gradients - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Component/s: None
Labels:
None

Description

This ticket is going to update one API in autograd.py

def backward(y, dy=None)
# returns the gradient tensor one by one using yield

def gradients(y, dy=None)
# returns a dictionary: param tensor -> gradient tensor

With the backward() API, we can update the param immediately after its gradient is available. Then, the gradient tensor can be deleted and the memory is released.

The gradients() API keeps all gradient tensors in the memory, which incurs memory overhead.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: wangwei

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 11/Jul/18 07:18

Updated:: 11/Jul/18 08:29