Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-42883 Implement Pandas API Missing Parameters
  3. SPARK-42194

Allow `columns` parameter when creating DataFrame with Series.

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.5.0
    • 3.4.1
    • Pandas API on Spark
    • None

    Description

      pandas API on Spark doesn't allow creating DataFrame with Series by specifying the `columns` parameter as below:

      >>> ps.DataFrame(psser, columns=["labels"])
      Traceback (most recent call last):
        File "<stdin>", line 1, in <module>
        File ".../spark/python/pyspark/pandas/frame.py", line 539, in __init__
          assert columns is None
      AssertionError 

      We should make it available.

      Attachments

        Activity

          People

            itholic Haejoon Lee
            itholic Haejoon Lee
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: