Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
None
-
None
Description
LSHModel.approxNearestNeighbors sorts the full dataset on the hashDistance in order to find a threshold. It should use approxQuantile instead.
Attachments
Issue Links
- Is contained by
-
SPARK-18454 Changes to improve Nearest Neighbor Search for LSH
- Resolved
- is related to
-
SPARK-30120 LSH approxNearestNeighbors should use BoundedPriorityQueue when numNearestNeighbors is small
- Resolved
- links to
(2 links to)