Details
-
Improvement
-
Status: Open
-
P2
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Add an implementation for https://s.apache.org/batched-dofns to the Python SDK.
Attachments
Issue Links
- blocks
-
BEAM-14044 Hook In Batching DoFn Apis to RunInference
- Open
- is related to
-
BEAM-14540 Native implementation for serialized Rows to/from Arrow
- Open
- links to
1.
|
Add API and construction-time validation for process_batch (with assumptions) | Resolved | Brian Hulette | |||||||||
2.
|
Add @DoFn.yields_batches and @DoFn.yields_elements decorators to override defaults | Open | Brian Hulette |
|
||||||||
3.
|
MVP for SDK worker changes to support process_batch | Resolved | Brian Hulette |
|
||||||||
4.
|
More verbose errors when BatchConverter fails to match | Open | Unassigned | |||||||||
5.
|
Consider providing a dynamic API for declaring batch input type | Open | Unassigned | |||||||||
6.
|
Add a TimestampedBatch analogue for TimestampedValue | Open | Unassigned | |||||||||
7.
|
batch-consuming DoFns should estimate byte size | Open | Brian Hulette |
|
||||||||
8.
|
Support per-key DoFn params in process_batch | Open | Unassigned |