Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-3506

JdbcIO: Support writing iterables (i.e. collections) of rows instead of only single rows

Details

    Description

      The current JdbcIO write interface expects a PCollection<T> where T is the row to be written. Each instance of T is then added to a batch and written to the database. The user has little control over how many rows will be added to one batch. If JdbcIO would also support writing a PCollection<Iterable<T>> the user would have more control over the number of rows in one batch. Especially when writing to cloud databases, such as Google Cloud Spanner, the batching of multiple rows together is important for performance.

      I already have a solution locally and I will submit a pull request.

      Attachments

        Issue Links

          Activity

            People

              jbonofre Jean-Baptiste Onofré
              koloite Knut Olav Loite
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - 4h
                  4h
                  Remaining:
                  Time Spent - 0.5h Remaining Estimate - 3.5h
                  3.5h
                  Logged:
                  Time Spent - 0.5h Remaining Estimate - 3.5h
                  0.5h