Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-34629 Python type hints improvement
  3. SPARK-34544

pyspark toPandas() should return pd.DataFrame

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.1.1
    • 3.3.0
    • PySpark
    • None

    Description

      Right now toPandas() returns DataFrameLike, which is an incomplete "view" of pandas DataFrame. Which leads to cases like mypy reporting that certain pandas methods are not present in DataFrameLike, even tho those methods are valid methods on pandas DataFrame, which is the actual type of the object. This requires type ignore comments or asserts.

      Attachments

        Issue Links

          Activity

            People

              zero323 Maciej Szymkiewicz
              ravwojdyla Rafal Wojdyla
              Votes:
              1 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: