[SPARK-17748] One-pass algorithm for linear regression with L1 and elastic-net penalties - ASF JIRA

Log work

Agile Board

Rank to Top

Rank to Bottom

Attach files

Attach Screenshot

Bulk Copy Attachments

Bulk Move Attachments

Voters

Watch issue

Watchers

Create sub-task

Convert to sub-task

Move

Link

Clone

Labels

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

Delete

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.1.0
Component/s: ML
Labels:
None

Target Version/s:

2.1.0

Description

Currently linear regression uses weighted least squares to solve the normal equations locally on the driver when the dimensionality is small (<4096). Weighted least squares uses a Cholesky decomposition to solve the problem with L2 regularization (which has a closed-form solution). We can support L1/elasticnet penalties by solving the equations locally using OWL-QN solver.

Also note that Cholesky does not handle singular covariance matrices, but L-BFGS and OWL-QN are capable of providing reasonable solutions. This patch can also add support for solving singular covariance matrices by also adding L-BFGS.