Details
-
New Feature
-
Status: Resolved
-
P2
-
Resolution: Implemented
-
None
Description
- Builds off of the work accomplished under
BEAM-13685 - Modified interactive_environment.py to support caching to a bucket on GCS for batch processing pipelines
- If a specified bucket does not exist, the pipeline will terminate and return an error specifying that the bucket does not exist
- Added cleanup() functionality to cache_manager.py, to enable the FileBasedCacheManager class to automatically delete cached values on GCS when a bucket path is specified
- Added docstring to interactive_beam.py with an example of GCS path assignment
- Cached files on GCS will be stored under a directory represented by the value of id(pipeline).
- Example cached path: gs://my-gcs-bucket/cache/dir/id(pipeline)
Attachments
Issue Links
- links to