[SPARK-2510] word2vec: Distributed Representation of Words - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.1.0
Component/s: MLlib
Labels:
None

Target Version/s:

1.1.0

Description

We would like to add parallel implementation of word2vec to MLlib. word2vec finds distributed representation of words through training of large data sets. We will focus on skip-gram model and hierarchical softmax in our initial implementation.

Attachments

Issue Links

links to

[Github] Pull Request #1719 (Ishiihara)

Activity

People

Assignee:: Liquan Pei

Reporter:: Liquan Pei

Votes:: 1 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 16/Jul/14 03:58

Updated:: 04/Aug/14 06:58

Resolved:: 04/Aug/14 06:58

Time Tracking

Estimated:

672h

Remaining:

672h

Logged:

Not Specified