Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-34849 SPIP: Support pandas API layer on PySpark
  3. SPARK-35605

Move to_pandas_on_spark to the Spark DataFrame.

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.2.0
    • 3.2.0
    • PySpark
    • None

    Description

      Inspired by https://github.com/apache/spark/pull/32729#discussion_r643591322,

      As Koalas is ported into PySpark, we don't need auto patch anymore (https://github.com/apache/spark/blob/master/python/pyspark/pandas/__init__.py#L136-L150)

      Thus, we should to_pandas_on_spark and to_koalas(deprecated) to the PySpark DataFrame and add related tests.

      Attachments

        Activity

          People

            itholic Haejoon Lee
            itholic Haejoon Lee
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: