Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-8988

apache_beam.io.gcp.bigquery_read_it_test failing with: NotImplementedError: BigQuery source must be split before being read

Details

    • Bug
    • Status: Resolved
    • P1
    • Resolution: Fixed
    • None
    • 2.19.0
    • io-py-gcp
    • None
    • The Big 2020 Sprint

    Description

      Sample failure: https://builds.apache.org/job/beam_PostCommit_Python37_PR/58/

      Triggered by https://github.com/apache/beam/pull/9772.

      Stacktrace:

      Pipeline BeamApp-jenkins-1217231928-2108ede4_7476773b-6b06-4536-a0d5-c5fafb6c0935 failed in state FAILED: java.lang.RuntimeException: Error received from SDK harness for instruction 96: Traceback (most recent call last):
        File "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37_PR/src/sdks/python/apache_beam/runners/common.py", line 879, in process
          return self.do_fn_invoker.invoke_process(windowed_value)
        File "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37_PR/src/sdks/python/apache_beam/runners/common.py", line 669, in invoke_process
          windowed_value, additional_args, additional_kwargs, output_processor)
        File "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37_PR/src/sdks/python/apache_beam/runners/common.py", line 747, in _invoke_process_per_window
          windowed_value, self.process_method(*args_for_process))
        File "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37_PR/src/sdks/python/apache_beam/runners/common.py", line 998, in process_outputs
          for result in results:
        File "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37_PR/src/sdks/python/apache_beam/runners/worker/bundle_processor.py", line 1256, in process
          yield element, self.restriction_provider.initial_restriction(element)
        File "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37_PR/src/sdks/python/apache_beam/io/iobase.py", line 1518, in initial_restriction
          range_tracker = self._source.get_range_tracker(None, None)
        File "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37_PR/src/sdks/python/apache_beam/io/gcp/bigquery.py", line 652, in get_range_tracker
          raise NotImplementedError('BigQuery source must be split before being read')
      NotImplementedError: BigQuery source must be split before being read
      

      Attachments

        Issue Links

          Activity

            People

              kamilwu Kamil Wasilewski
              tvalentyn Valentyn Tymofieiev
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 3h 10m
                  3h 10m