Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-26315

auto cast threshold from Integer to Float in approxSimilarityJoin of BucketedRandomProjectionLSHModel

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 2.3.2
    • Fix Version/s: 2.3.3, 2.4.1, 3.0.0
    • Component/s: MLlib, PySpark
    • Labels:
      None

      Description

      when I was using 

      // code placeholder
      BucketedRandomProjectionLSHModel.approxSimilarityJoin(dt_features, dt_features, distCol="EuclideanDistance", threshold=20.)
      

      I was confused then that this method reported an exception some java method (dataset, dataset, integer, string) fingerprint can not be found.... I think if I give an integer, and the python method of pyspark should be auto-cast this to float if needed. 

        Attachments

          Activity

            People

            • Assignee:
              cinqsong Song Ci
              Reporter:
              cinqsong Song Ci

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment