Details
-
Bug
-
Status: Triage Needed
-
P2
-
Resolution: Fixed
-
None
-
None
Description
By default, the Python SDK adds a timer output timestamp equal to the current timestamp of an element. This is problematic because
1. We hold back the output watermark on the current element's timestamp for every timer
2. It doesn't match the behavior in the Java SDK which defaults to using the fire timestamp as the timer output timestamp (and adds a hold on it)
3. There is no way for the user to influence this behavior because there is no user-facing API
We should use the fire timestamp as the default output timestamp.
Attachments
Issue Links
- relates to
-
BEAM-10691 FlinkRunner: pipeline slows down due to expensive output timestamp queue
- Resolved
- links to