Details
-
Bug
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
1.3.0, 1.4.0
-
None
-
None
-
None
Description
The ListGCSBucket processor returns duplicate flowfiles under some unknown circumstances. Dan Young reported the issue to the dev list (see ListGCSBucket and duplicates).
I was able to reproduce this issue by writing a constant stream of objects to a GCS bucket, while running ListGCBucket on a 30-second schedule reading the bucket and DetectDuplicate with a Cache Entry Identifier of ${gcs.key}.
Using a DetectDuplicate processor immediately following ListGCSBucket is also an effective workaround.