Uploaded image for project: 'Apache NiFi'
  1. Apache NiFi
  2. NIFI-4533

ListGCSBucket Returns Duplicate FlowFiles

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 1.3.0, 1.4.0
    • None
    • None
    • None

    Description

      The ListGCSBucket processor returns duplicate flowfiles under some unknown circumstances. Dan Young reported the issue to the dev list (see ListGCSBucket and duplicates).

      I was able to reproduce this issue by writing a constant stream of objects to a GCS bucket, while running ListGCBucket on a 30-second schedule reading the bucket and DetectDuplicate with a Cache Entry Identifier of ${gcs.key}.

      Using a DetectDuplicate processor immediately following ListGCSBucket is also an effective workaround.

      Attachments

        Activity

          People

            Unassigned Unassigned
            jameswing James Wing
            Votes:
            1 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: