[SPARK-9874] UnionAll operation on DataFrame doesn't check for column names - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Duplicate
Affects Version/s: 1.4.0
Fix Version/s: None
Component/s: SQL
Labels:
None

Description

UnionAll operation in dataFrame checks only for the column dataType. For example if df1 has a field id of type String and df2 has a field city of type String then, unionAll appends both dataFrames one after another.
This should not be allowed. Either it should create combined schema or it should throw error.

Attachments

Issue Links

duplicates

SPARK-9813 Incorrect UNION ALL behavior

Resolved

is related to

SPARK-15918 unionAll returns wrong result when two dataframes has schema in different order

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: Raghavendra Kumar Pandey

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 12/Aug/15 09:12

Updated:: 13/Jun/16 21:58

Resolved:: 12/Aug/15 14:14