Uploaded image for project: 'Phoenix'
  1. Phoenix
  2. PHOENIX-6667

Spark3 connector requires that all columns are specified when writing

    XMLWordPrintableJSON

Details

    Description

      For Spark 2, it was possible to omit some columns from the dataframe, the same way it is not mandatory to specify all columns when upserting via SQL.

      Spark3 has added new checks, which require that EVERY sql column is specifed in the DataFrame.

      Consequently, when using the current API, writing will fail unless you specify all columns.

      This is a loss of functionality WRT Phoenix (and other SQL datastores) compared to Spark2.

      I don't think that we can do anything from the Phoenix side, just documenting the regression here.

      Maybe future Spark versions will make this configurable.

      Attachments

        Issue Links

          Activity

            People

              attilapiros Attila Zsolt Piros
              stoty Istvan Toth
              Votes:
              1 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: