Description
We would like to add parallel implementation of word2vec to MLlib. word2vec finds distributed representation of words through training of large data sets. We will focus on skip-gram model and hierarchical softmax in our initial implementation.