Details
-
Sub-task
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
This task aims to improve the performance of workers. Currently, in each iteration of mini-batch, we need to slice the matrix, execute the gradients computation and then send them to the ps for updating the model. While the ps is doing the aggregation work, the worker pauses due to waiting for the new model. Hence the idea is to completely use this free slot to pre-fetch the mini-batch in order to accelerate the future iteration.