Details
-
Improvement
-
Status: Open
-
P3
-
Resolution: Unresolved
-
None
-
None
Description
Java's org.apache.beam.sdk.transforms.Sample.any will return a PCollection with bounded size (filtering strategy).
The closest python eqiuvalent is beam.Sample.FixedSizeGlobally whover, this version uses a combiner strategy, returning a list with n elements; which does not scale if n is "bigger than what fits in memory".