Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-10984

Dataflow's UnboundedReaderIterator should support overriding default bundle size targets

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Triage Needed
    • Priority: P3
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: runner-dataflow
    • Labels:
      None

      Description

      Currently, the UnboundedReaderIterator will read until 10k elements have been read or 10s have passed.  This works for most pipelines, but is insufficient for pipelines that either require very high throughput or require low latency.  We should make these values controllable via a PipelineOption, probably under DataflowPipelineDebugOptions.

      The constants are defined here: https://github.com/apache/beam/blob/master/runners/google-cloud-dataflow-java/worker/src/main/java/org/apache/beam/runners/dataflow/worker/WorkerCustomSources.java#L757

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              millsd@google.com Daniel Mills
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: