Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-13268

Reduce latency by parallelizing BQ inserts when flushing due to row limit

Details

    • Improvement
    • Status: Resolved
    • P2
    • Resolution: Fixed
    • None
    • 2.35.0
    • io-java-gcp
    • None

    Description

      InsertBatchedElements consumes the output of GroupIntoBatches which flushes after default 500 elements to respect the max items per streaming insert request to BQ.

      However InsertBatchedElements flushes rows synchrounously meaning that latencies of writes accumulates. It could instead initiate writes in ProcessElement and block on write completion in FinishBundle. There could be some limited configurable parallelism if desired to limit memory usage.

      Attachments

        Issue Links

          Activity

            People

              scwhittle Sam Whittle
              scwhittle Sam Whittle
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h
                  1h