[SPARK-19071] Optimizations for ML Pipeline Tuning - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Incomplete
Affects Version/s: None
Fix Version/s: None
Component/s: ML
Labels:
- bulk-closed

Description

This is a parent task to plan the addition of optimizations in ML tuning for parallel model evaluation and more efficiency with pipelines. They will benefit Crossvalidator and TrainValidationSplit when performing a parameter grid search. The proposal can be broken into 3 steps in order of simplicity:

1. Add ability to evaluate models in parallel.

2. Optimize param grid for pipelines, as described in ~~SPARK-5844~~

3. Add parallel model evaluation to the optimized pipelines from step 2

See the linked design document for details on the proposed implementation.

Attachments

Issue Links

contains

SPARK-5844 Optimize Pipeline.fit for ParamGrid

Resolved

supercedes

SPARK-14084 Parallel training jobs in model selection

Resolved

links to

Design Doc

Sub-Tasks

There are no Sub-Tasks for this issue.

Activity

People

Assignee:: Unassigned

Reporter:: Bryan Cutler

Votes:: 1 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 04/Jan/17 00:29

Updated:: 21/May/19 04:14

Resolved:: 21/May/19 04:14