Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-26315

auto cast threshold from Integer to Float in approxSimilarityJoin of BucketedRandomProjectionLSHModel

Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsVotersStop watchingWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete CommentsDelete
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 2.3.2
    • 2.3.3, 2.4.1, 3.0.0
    • MLlib, PySpark
    • None

    Description

      when I was using 

      // code placeholder
      BucketedRandomProjectionLSHModel.approxSimilarityJoin(dt_features, dt_features, distCol="EuclideanDistance", threshold=20.)
      

      I was confused then that this method reported an exception some java method (dataset, dataset, integer, string) fingerprint can not be found.... I think if I give an integer, and the python method of pyspark should be auto-cast this to float if needed. 

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            cinqsong Song Ci Assign to me
            cinqsong Song Ci
            Votes:
            0 Vote for this issue
            Watchers:
            4 Stop watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment