Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
3.3.0
-
None
-
None
Description
New APIs in Structured Streaming tend to get added to Java/Scala first. This creates a situation where the Python API have fallen behind. For example map/flatMapGroupsWithState is not supported in the Pyspark. We need Pyspark API to catch up with the Java/Scala APIs and, where necessary, provide tighter integrations with native python data processing frameworks such as Pandas.
Attachments
Issue Links
- Is contained by
-
SPARK-40025 Project Lightspeed: Faster and Simpler Stream Processing with Apache Spark
- Open
- is duplicated by
-
SPARK-40431 Introduce "Arbitrary Stateful Processing" in Structured Streaming with Python
- Open