Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-2864

Support backfill deduplication in BigQueryIO.write()

Details

    • Bug
    • Status: Open
    • P3
    • Resolution: Unresolved
    • None
    • None
    • io-java-gcp
    • None

    Description

      See https://github.com/GoogleCloudPlatform/DataflowJavaSDK/issues/603 motivated by SO question https://stackoverflow.com/questions/46076914/apache-beam-update-bigquery-table-row-with-bigqueryio

      Perhaps one way we can do this is make BigQueryIO return a PValue that can be sequenced with other things, and implement a BigQuery.update() transform that executes a single DML statement (or a small collection thereof - since DML in BigQuery is very scarce), and let the user sandwich them together if they would like to.

      Attachments

        Activity

          People

            Unassigned Unassigned
            jkff Eugene Kirpichov
            Votes:
            1 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: