Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-28003

spark.createDataFrame with Arrow doesn't work with pandas.NaT

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.3.3, 2.4.3
    • 3.0.0
    • PySpark
    • None

    Description

      import pandas as pd
      dt1 = [pd.NaT, pd.Timestamp('2019-06-11')] * 100
      pdf1 = pd.DataFrame({'time': dt1})
      
      df1 = self.spark.createDataFrame(pdf1)
      

      The example above doesn't work with arrow enabled.

      Attachments

        Issue Links

          Activity

            People

              icexelloss Li Jin
              icexelloss Li Jin
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: