Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-11472

taxi_dataframe example broken

Details

    • Bug
    • Status: Resolved
    • P1
    • Resolution: Fixed
    • None
    • 2.27.0
    • sdk-py-core
    • None

    Description

      I just tried running the taxi_dataframe example at HEAD on Dataflow and found that it fails with the following error:

      INFO:apache_beam.runners.dataflow.dataflow_runner:2020-12-15T16:27:55.653Z: JOB_MESSAGE_ERROR: Traceback (most recent call last):
        File "apache_beam/runners/common.py", line 1214, in apache_beam.runners.common.DoFnRunner.process
        File "apache_beam/runners/common.py", line 721, in apache_beam.runners.common.PerWindowInvoker.invoke_process
        File "apache_beam/runners/common.py", line 753, in apache_beam.runners.common.PerWindowInvoker._should_process_window_for_sdf
        File "apache_beam/runners/common.py", line 548, in apache_beam.runners.common.DoFnInvoker.invoke_create_tracker
        File "/usr/local/lib/python3.8/site-packages/apache_beam/dataframe/io.py", line 313, in create_tracker
          tracker = beam.io.restriction_trackers.OffsetRestrictionTracker(restriction)
        File "/usr/local/lib/python3.8/site-packages/apache_beam/io/restriction_trackers.py", line 91, in __init__
          assert isinstance(offset_range, OffsetRange)
      AssertionError 
      

      https://github.com/apache/beam/pull/13443 Seems like the most likely culprit, and I confirmed that running with Beam before that commit (at bd825f574e342cfa83fb09767c7d5a19a3accc55) does not fail.

      Attachments

        Issue Links

          Activity

            People

              robertwb Robert Bradshaw
              bhulette Brian Hulette
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 40m
                  40m