Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.4.0
Description
In SPARK-25314, we supported the scenario of having a python UDF that refers to attributes from both legs of a join condition by rewriting
the plan to convert an inner join or left semi join to a filter over a cross join. In case of left semi join, this transformation may
cause incorrect results when the right leg of join condition produces duplicate rows based on the join condition.