Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-3000

No python equivalent of org.apache.beam.sdk.transforms.Sample.any(100)?

Details

    • Improvement
    • Status: Open
    • P3
    • Resolution: Unresolved
    • None
    • None
    • sdk-py-core

    Description

      Java's org.apache.beam.sdk.transforms.Sample.any will return a PCollection with bounded size (filtering strategy).
      The closest python eqiuvalent is beam.Sample.FixedSizeGlobally whover, this version uses a combiner strategy, returning a list with n elements; which does not scale if n is "bigger than what fits in memory".

      Attachments

        Activity

          People

            Unassigned Unassigned
            rodrigob Rodrigo Benenson
            Votes:
            2 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated: