Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-10991

Timers don't release watermark holds in dataflow on 2.24

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: P1
    • Resolution: Unresolved
    • Affects Version/s: 2.24.0
    • Fix Version/s: 2.25.0
    • Component/s: runner-dataflow
    • Labels:
      None

      Description

      We have multiple streaming pipelines (using state + timers) that, after upgrading to 2.24, exhibited very strange watermark behavior.  The watermark on some stateful DoFns would advance to the end of the first window, and then get stuck there forever, even preventing the job from draining.

      I was able to track the problem down to https://github.com/apache/beam/commit/88acc5267f759d81e9836a9db17b9e0ee521c785.  After revering it, the behavior went back to normal.  It looks like its possible in that commit that watermark holds for some timers aren't  being cleared.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                kenn Kenneth Knowles
                Reporter:
                SteveNiemitz Steve Niemitz
              • Votes:
                1 Vote for this issue
                Watchers:
                7 Start watching this issue

                Dates

                • Created:
                  Updated:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 4h
                  4h