Description
In Word2VecModel, wordVecNorms may contains Euclidean Norm equals to zero. This will cause incorrect calculation for cosine distance. when you do cosineVec(ind) / wordVecNorms(ind). Cosine distance should be equal to 0 for norm = 0.
In Word2VecModel, wordVecNorms may contains Euclidean Norm equals to zero. This will cause incorrect calculation for cosine distance. when you do cosineVec(ind) / wordVecNorms(ind). Cosine distance should be equal to 0 for norm = 0.
Hi, is any body working on this (and
SPARK-7617,SPARK-7618)? I have been using Spark for some time, and would like get started contributing. This looks like a trivial issue (good fo a newbie).Shall I provide a pull request for this? Would it make sense a pull request for the three related issues?