Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-8292

Add a Reshuffle PTransform preventing fusion of the surrounding transforms

Details

    • New Feature
    • Status: Resolved
    • P3
    • Resolution: Fixed
    • None
    • Not applicable
    • sdk-go
    • None

    Description

      Reshuffle is a PTransform that takes a PCollection<A> and shuffles the data to help increase parallelism.
      Reshuffle adds a temporary random key to each element, performs a
      GroupByKey, and finally removes the temporary key.

      Attachments

        Activity

          People

            lostluck Robert Burke
            johnpatoch69 John Patoch
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 4h
                4h