Details
-
New Feature
-
Status: Triage Needed
-
P3
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Add a PTransform that batches inputs to a desired batch size. Batches will contain only elements of a single key.
It should offer the same API as its Java counterpart: https://github.com/apache/beam/blob/11a977b8b26eff2274d706541127c19dc93131a2/sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/GroupIntoBatches.java
Unlike BatchElements transform (https://github.com/apache/beam/blob/11a977b8b26eff2274d706541127c19dc93131a2/sdks/python/apache_beam/transforms/util.py#L461) GroupIntoBatches will use state to batch across bundles as well.
Attachments
Issue Links
- links to