Uploaded image for project: 'Apache NiFi'
  1. Apache NiFi
  2. NIFI-2925

FlowFiles that are swapped out are never released from Content Repository

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • 1.0.0
    • 1.1.0
    • Core Framework
    • None

    Description

      To reproduce this, I created a simple Flow: GenerateFlowFile (1 KB file size) with success going to 2 different UpdateAttribute Processors (so that the same Content Claim is held by 2 different FlowFiles). I let about 150,000 FlowFiles queue up (with backpressure turned off). I then start one of the UpdateAttribute processors. This drained its queue. I could then look at my content repo for any files not archived:

      content_repository $ find . -type f | grep -v archive | wc -l
           192
      

      After a few minutes, the FlowFile repo is checkpointed, which will result in things getting cleaned up if they can. The above command shows the same result (expected, since the FlowFiles are still held. I then empty the queue. After the FlowFile checkpoints again, I should see nothing in the content repo outside of archive, but I see:

      content_repository $ find . -type f | grep -v archive | wc -l
           167
      

      I see the same thing happening if I turn on expiration to remove the FlowFiles instead of clicking Empty Queue, or if a processor runs and completes the processing of the data.

      Attachments

        Issue Links

          Activity

            People

              markap14 Mark Payne
              markap14 Mark Payne
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: