[SPARK-6567] Large linear model parallelism via a join and reduceByKey - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Won't Fix
Affects Version/s: None
Fix Version/s: None
Component/s: ML, MLlib
Labels:
None

Description

To train a linear model, each training point in the training set needs its dot product computed against the model, per iteration. If the model is large (too large to fit in memory on a single machine) then ~~SPARK-4590~~ proposes using parameter server.

There is an easier way to achieve this without parameter servers. In particular, if the data is held as a BlockMatrix and the model as an RDD, then each block can be joined with the relevant part of the model, followed by a reduceByKey to compute the dot products.

This obviates the need for a parameter server, at least for linear models. However, it's unclear how it compares performance-wise to parameter servers.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

model-parallelism.pptx
21/Apr/15 07:41
243 kB
hucheng zhou

Issue Links

is superceded by

SPARK-10078 Vector-free L-BFGS

Resolved

relates to

SPARK-6932 A Prototype of Parameter Server

Resolved

SPARK-4590 Early investigation of parameter server

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: Reza Zadeh

Shepherd:: Xiangrui Meng

Votes:: 2 Vote for this issue

Watchers:: 25 Start watching this issue

Dates

Created:: 27/Mar/15 08:27

Updated:: 27/Mar/17 00:01

Resolved:: 28/Feb/17 10:32