Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-7326

Document that Beam BigQuery IO expects users to pass base64-encoded bytes, and BQ IO serves base64-encoded bytes to the user.

Details

    Description

      BYTES is one of the Datatypes supported by Google Cloud BigQuery, and Apache Beam BigQuery IO connector.

      Current implementation of BigQuery connector in Java and Python SDKs expects that users base64-encode bytes before passing them to BigQuery IO, see discussion on dev: [1]

      This needs to be reflected in public documentation, see [2-4]

      cc: juta chamikara pabloem

      cc: lostluck kedin FYI and to advise whether similar action needs to be done for Go SDK and/or Beam SQL.

      [1] https://lists.apache.org/thread.html/f35c836887014e059527ed1a806e730321e2f9726164a3030575f455@%3Cdev.beam.apache.org%3E
      [2] https://beam.apache.org/documentation/io/built-in/google-bigquery/
      [3] https://beam.apache.org/releases/pydoc/2.12.0/apache_beam.io.gcp.bigquery.html
      [4] https://beam.apache.org/releases/javadoc/2.12.0/org/apache/beam/sdk/io/gcp/bigquery/BigQueryIO.html

      Attachments

        Issue Links

          Activity

            People

              Juta Juta Staes
              tvalentyn Valentyn Tymofieiev
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 3h 50m
                  3h 50m