[FLINK-22913] Support Python UDF chaining in Python DataStream API - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Done
Affects Version/s: None
Fix Version/s: 1.14.0
Component/s: API / Python
Labels:
None

Release Note:

Hide
The job graph of Python DataStream API jobs may be different from before as the Python functions will be chained as much as possible to optimize the performance. You could disable Python functions chaining by setting 'python.operator-chaining.enabled' as 'false' explicitly.

Show
The job graph of Python DataStream API jobs may be different from before as the Python functions will be chained as much as possible to optimize the performance. You could disable Python functions chaining by setting 'python.operator-chaining.enabled' as 'false' explicitly.

Description

Currently, for the following job:

ds = ..
ds.map(map_func1)
    .map(map_func2)

The Python function `map_func1` and `map_func2` will runs in separate Python workers and the result of `map_func1` will be transferred to JVM and then transferred to `map_func2` which may resides in another Python worker. This introduces redundant communication and serialization/deserialization overhead.

Attachments

Sub-Tasks

1.	Remove ProcessFunctionOperation	Closed	Dian Fu
2.	Support to decode a single record for the Python coder	Closed	Dian Fu
3.	Separate data and timer connection into different channels for Python DataStream API operators	Closed	Dian Fu
4.	Remove Python operators PythonFlatMapOperator/PythonMapOperator/PythonPartitionCustomOperator and use PythonProcessOperator instead	Closed	Dian Fu
5.	Introduce PythonCoProcessOperator and remove PythonCoFlatMapOperator & PythonCoMapOperator	Closed	Dian Fu
6.	Support to chain the Python DataStream operators as much as possible	Closed	Dian Fu
7.	Remove PythonTimestampsAndWatermarksOperator	Closed	Dian Fu
8.	Add documentation about Python DataStream API chaining optimization	Closed	Dian Fu
9.	Testing Python DataStream API chaining functionality	Closed	Dian Fu

Activity

People

Assignee:: Dian Fu

Reporter:: Dian Fu

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 08/Jun/21 06:04

Updated:: 13/Sep/21 12:02

Resolved:: 24/Aug/21 09:29