Details
-
Wish
-
Status: Resolved
-
P3
-
Resolution: Duplicate
-
None
-
None
Description
I've seen repeatedly the following pattern: a user wants to join a PCollection against a slowly-changing external dataset: e.g. a file on GCS, or a Bigtable, etc.
Side inputs come to mind, but current side input mechanisms don't allow for something like periodically reloading the side input.
The best hacky solution I came up with for one use case is documented here: http://stackoverflow.com/questions/41254028/can-dataflow-sideinput-be-updated-per-window-by-reading-a-gcs-bucket/41271159#41271159 , we need to do better than this.
Attachments
Issue Links
- duplicates
-
BEAM-9650 Add consistent slowly changing side inputs support
- Triage Needed