Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-17548

Word2VecModel.findSynonyms can spuriously reject the best match when invoked with a vector

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 1.4.1, 1.5.2, 1.6.2, 2.0.0
    • 2.0.1, 2.1.0
    • MLlib
    • None
    • any

    Description

      The `findSynonyms` method in `Word2VecModel` currently rejects the best match a priori. When `findSynonyms` is invoked with a word, the best match is almost certain to be that word, but `findSynonyms` can also be invoked with a vector, which might not correspond to any of the words in the model's vocabulary. In the latter case, rejecting the best match is spurious.

      Attachments

        Activity

          People

            willbenton William Benton
            willbenton William Benton
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: