Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
2.1.0
-
None
Description
Creating a union DataFrame over two sources that have different schema definitions is surprisingly complex. Provide a version of the union method that will create a infer a target schema as the result of merging the sources. Automatically add extend either side with null columns for any missing columns that are nullable.
Attachments
Issue Links
- is related to
-
SPARK-21316 Dataset Union output is not consistent with the column sequence
- Resolved
- relates to
-
SPARK-21043 Add unionByName API to Dataset
- Resolved