Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-19615

Provide Dataset union convenience for divergent schema

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Incomplete
    • 2.1.0
    • None
    • SQL

    Description

      Creating a union DataFrame over two sources that have different schema definitions is surprisingly complex. Provide a version of the union method that will create a infer a target schema as the result of merging the sources. Automatically add extend either side with null columns for any missing columns that are nullable.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              ndimiduk Nick Dimiduk
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: