[SPARK-20897] cached self-join should not fail - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.2.0
Fix Version/s: 2.2.0
Component/s: SQL
Labels:
None

Target Version/s:

2.2.0

Description

code to reproduce this bug:

// force to plan sort merge join
spark.conf.set("spark.sql.autoBroadcastJoinThreshold", "0")
val df = Seq(1 -> "a").toDF("i", "j")
val df1 = df.as("t1")
val df2 = df.as("t2")
assert(df1.join(df2, $"t1.i" === $"t2.i").cache().count() == 1)

Attachments

Issue Links

links to

[Github] Pull Request #18121 (cloud-fan)

Activity

People

Assignee:: Wenchen Fan

Reporter:: Wenchen Fan

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 26/May/17 15:06

Updated:: 27/May/17 23:18

Resolved:: 27/May/17 23:18