Details
-
Improvement
-
Status: Resolved
-
Critical
-
Resolution: Done
-
3.1.2
-
None
Description
We added PySpark type hints at SPARK-32681
However, looks like there are still many missing APIs to type. I maintain a project called [Koalas](https://github.com/databricks/koalas), and I found these errors https://gist.github.com/HyukjinKwon/9faabc5f2680b56007d71ef7cf0ad400
For example, pyspark._version_ and pyspark.sql.Column.contains are missing in the type hints.
I believe this is the same case to other projects that enables mypy in their project (presumably also given SPARK-34544).
This umbrella JIRA targets to identify such cases and improve Python type hints in PySpark.
Attachments
1.
|
Add type hints of pyspark.__version__ and pyspark.sql.Column.contains | Resolved | Danny Meijer | |
2.
|
pyspark toPandas() should return pd.DataFrame | Resolved | Maciej Szymkiewicz | |
3.
|
Add type hints of pyspark.sql.column.* and pyspark.sql.types.* | Resolved | Unassigned | |
4.
|
Improve type hints on pyspark.sql.* | Resolved | Yikun Jiang |