Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-50146

Consolidate configurable schema verification of createDataFrame

    XMLWordPrintableJSON

Details

    Description

      verifySchema parameter of createDataFrame decides whether to verify data types of every row against schema.

      In Spark Classic

      Now it only takes effect for with createDataFrame with

      • regular Python instances

      We propose to make it work with createDataFrame with

      • pyarrow.Table
      • pandas.DataFrame with Arrow optimization
      • pandas.DataFrame without Arrow optimization

      In Spark Connect

      Now it does not take effect.

      We propose to make it work with all inputs.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              XinrongM Xinrong Meng
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: