[SPARK-10972] UDFs in SQL joins - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Incomplete
Affects Version/s: 1.5.1
Fix Version/s: None
Component/s: SQL
Labels:
- bulk-closed

Description

Currently expressions used to .join() in DataFrames are limited to column names plus the operators exposed in org.apache.spark.sql.Column.

It would be nice to be able to .join() based on a UDF, such as, say, euclideanDistance(col1, col2) < 0.1.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Michael Malak

Votes:: 1 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 07/Oct/15 15:04

Updated:: 21/May/19 04:34

Resolved:: 21/May/19 04:34