[MAHOUT-1272] Parallel SGD matrix factorizer for SVDrecommender - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.8
Component/s: None
Labels:
- features
- patch
- test

Description

a parallel factorizer based on ~~MAHOUT-1089~~ may achieve better performance on multicore processor.

existing code is single-thread and perhaps may still be outperformed by the default ALS-WR.

In addition, its hardcoded online-to-batch-conversion prevents it to be used by an online recommender. An online SGD implementation may help build high-performance online recommender as a replacement of the outdated slope-one.

The new factorizer can implement either DSGD (http://www.mpi-inf.mpg.de/~rgemulla/publications/gemulla11dsgd.pdf) or hogwild! (www.cs.wisc.edu/~brecht/papers/hogwildTR.pdf).

Related discussion has been carried on for a while but remain inconclusive:
http://web.archiveorange.com/archive/v/z6zxQUSahofuPKEzZkzl

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

GroupLensSVDRecomenderEvaluatorRunner.java
07/Jul/13 21:49
4 kB
Peng Cheng
libimsetiSVDRecomenderEvaluatorRunner.java
13/Jul/13 20:56
5 kB
Peng Cheng
mahout.patch
06/Jul/13 03:05
23 kB
Peng Cheng
NetflixRecomenderEvaluatorRunner.java
14/Jul/13 02:14
5 kB
Peng Cheng
ParallelSGDFactorizer.java
07/Jul/13 21:49
14 kB
Peng Cheng
ParallelSGDFactorizer.java
06/Jul/13 02:57
12 kB
Peng Cheng
ParallelSGDFactorizerTest.java
07/Jul/13 21:49
11 kB
Peng Cheng
ParallelSGDFactorizerTest.java
06/Jul/13 02:57
10 kB
Peng Cheng

Activity

People

Assignee:: Sean R. Owen

Reporter:: Peng Cheng

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 28/Jun/13 20:53

Updated:: 31/Jan/24 22:11

Resolved:: 07/Jul/13 23:51

Time Tracking

Estimated:

336h

Remaining:

336h

Logged:

Not Specified