[SPARK-4823] rowSimilarities - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Auto Closed
Affects Version/s: None
Fix Version/s: None
Component/s: MLlib
Labels:
- bulk-closed

Description

RowMatrix has a columnSimilarities method to find cosine similarities between columns.

A rowSimilarities method would be useful to find similarities between rows.

This is JIRA is to investigate which algorithms are suitable for such a method, better than brute-forcing it. Note that when there are many rows (> 10^6), it is unlikely that brute-force will be feasible, since the output will be of order 10^12.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SparkMeetup2015-Experiments2.pdf
30/Jul/15 21:31
56 kB
Debasish Das
SparkMeetup2015-Experiments1.pdf
30/Jul/15 21:31
64 kB
Debasish Das
MovieLensSimilarity Comparisons.pdf
24/May/15 02:07
93 kB
Debasish Das

Issue Links

relates to

SPARK-3066 Support recommendAll in matrix factorization model

Resolved

SPARK-4675 Find similar products and similar users in MatrixFactorizationModel

Resolved

links to

[Github] Pull Request #6213 (debasish83)

Activity

People

Assignee:: Unassigned

Reporter:: Reza Zadeh

Votes:: 5 Vote for this issue

Watchers:: 12 Start watching this issue

Dates

Created:: 11/Dec/14 02:15

Updated:: 06/Jun/19 13:57

Resolved:: 06/Jun/19 13:57