Details
-
Umbrella
-
Status: Resolved
-
Major
-
Resolution: Done
-
3.3.0
Description
See https://koalas.readthedocs.io/en/latest/user_guide/typehints.html.
pandas API on Spark currently there's no way to specify the index type and name in the output when you apply an arbitrary function, which forces to create the default index:
>>> def transform(pdf) -> pd.DataFrame["id": int, "A": int]: ... pdf['A'] = pdf.id + 1 ... return pdf ... >>> ps.range(5).koalas.apply_batch(transform)
id A 0 0 1 1 1 2 2 2 3 3 3 4 4 4 5
We should have a way to specify the index.