[SPARK-10014] ML model broadcasts should be stored in private vars - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Umbrella
Status: Closed
Priority: Minor
Resolution: Invalid
Affects Version/s: None
Fix Version/s: None
Component/s: ML, MLlib
Labels:
None

Target Version/s:

1.6.0

Description

Multiple places in MLlib, we broadcast a model before prediction. Since prediction may be called many times, we should store the broadcast variable in a private var so that we broadcast at most once.

I'll link subtasks for each problem case I find.

Attachments

Sub-Tasks

1.	ML model broadcasts should be stored in private vars: spark.ml tree ensembles	Closed	Unassigned
2.	ML model broadcasts should be stored in private vars: spark.ml Word2Vec	Closed	Unassigned
3.	ML model broadcasts should be stored in private vars: mllib NaiveBayes	Closed	Unassigned
4.	ML model broadcasts should be stored in private vars: mllib clustering	Closed	Unassigned
5.	ML model broadcasts should be stored in private vars: mllib IDFModel	Closed	Unassigned
6.	ML model broadcasts should be stored in private vars: mllib GeneralizedLinearModel	Closed	Unassigned

Activity

People

Assignee:: Unassigned

Reporter:: Joseph K. Bradley

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 16/Aug/15 04:16

Updated:: 11/Sep/15 20:43

Resolved:: 11/Sep/15 20:43