Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
It aims to implement the parameter exchange between ps and workers. We could leverage netty framework to implement our own Rpc framework. In general, the netty TransportClient and TransportServer provides the sending and receiving service for ps and workers. Extending the RpcHandler allows to invoke the corresponding ps method (i.e., push/pull method) by handling the different input Rpc call object. And then the SparkPsProxy wrapping TransportClient allows the workers to execute the push/pull call to server. At the same time, the ps netty server also provides the file repository service which allows the workers to download the partitioned training data, so that the workers could rebuild the matrix object with the transfered file instead of broadcasting all the files with spark which are not all necessary for each worker.
Attachments
Attachments
Issue Links
- duplicates
-
SYSTEMDS-2423 Implementation of spark ps
- Resolved
- Is contained by
-
SYSTEMDS-2087 Initial version of distributed spark backend
- Resolved