[SPARK-14707] Linear algebra: clarify light vs heavy constructors and accessors - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Incomplete
Affects Version/s: None
Fix Version/s: None
Component/s: ML
Labels:
- bulk-closed

Description

MLlib linear algebra provides methods for constructing Vectors and Matrices and for accessing the vector/matrix data. There are currently 2 types of these constructors and accessors:

light: avoid data copy and validation, useful for converting between MLlib types and numpy, Breeze, etc.
heavy: copy or validate data, useful for constructing MLlib types from user inputs

MLlib is not very consistent about these and does not document which ops are light vs. heavy. This JIRA is for:

First, discussing which ops should be light vs heavy to choose a consistent API
Next, creating subtasks for Scala and Python for updating the implementations and clarifying the docs

Attachments

Issue Links

is blocked by

SPARK-13944 Separate out local linear algebra as a standalone module without Spark dependency

Resolved

is duplicated by

SPARK-16566 Bug in SparseMatrix multiplication with SparseVector

Closed

supercedes

SPARK-14697 mllib DenseMatrix toArray could use the internal values

Closed

Activity

People

Assignee:: Unassigned

Reporter:: Joseph K. Bradley

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 18/Apr/16 18:01

Updated:: 21/May/19 04:33

Resolved:: 21/May/19 04:33