Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-3042

Add tracking of bytes read / time spent when reading side inputs

Details

    • Bug
    • Status: Resolved
    • P2
    • Resolution: Fixed
    • None
    • 2.6.0
    • sdk-py-core
    • None

    Description

      It is difficult for Dataflow users to understand how modifying a pipeline or data set can affect how much inter-transform IO is used in their job. The intent of this feature request is to help users understand how side inputs behave when they are consumed.

      This will allow users to understand how much time and how much data their pipeline uses to read/write to inter-transform IO. Users will also be able to modify their pipelines and understand how their changes affect these IO metrics.

      For further information, please review the internal Google doc go/insights-transform-io-design-doc.

      Attachments

        Activity

          People

            pabloem Pablo Estrada
            pabloem Pablo Estrada
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 7h 10m
                7h 10m