[SPARK-19357] Parallel Model Evaluation for ML Tuning: Scala - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.3.0
Component/s: ML
Labels:
None

Description

This is a first step of the parent task of Optimizations for ML Pipeline Tuning to perform model evaluation in parallel. A simple approach is to naively evaluate with a possible parameter to control the level of parallelism. There are some concerns with this:

excessive caching of datasets
what to set as the default value for level of parallelism. 1 will evaluate all models in serial, as is done currently. Higher values could lead to excessive caching.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

parallelism-verification-test.pdf
11/Sep/17 23:15
402 kB
Bryan Cutler

Issue Links

is related to

SPARK-21911 Parallel Model Evaluation for ML Tuning: PySpark

Resolved

SPARK-22126 Fix model-specific optimization support for ML tuning

Resolved

relates to

SPARK-21027 Parallel One vs. Rest Classifier

Resolved

SPARK-19979 [MLLIB] Multiple Estimators/Pipelines In CrossValidator

Resolved

links to

[Github] Pull Request #16774 (BryanCutler)

Activity

People

Assignee:: Bryan Cutler

Reporter:: Bryan Cutler

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 25/Jan/17 01:10

Updated:: 01/Jan/18 01:00

Resolved:: 06/Sep/17 12:12