Uploaded image for project: 'Apache NiFi'
  1. Apache NiFi
  2. NIFI-3423

List based processors don't support source directories with high file count.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.1.1
    • None
    • Core Framework
    • None

    Description

      NiFi FlowFile attributes/metadata lives in heap. The List based processors return a complete listing from the target and then creates a FlowFile for each File in that returned listing. The FlowFiles being created are not committed to the list processor's success relationship until all have been created. So you end up running out of NiFi JVM heap memory before that can happen when the returned listing is very large.

      It would be nice if the list based processors could commit batches (10,000) of FlowFiles at a time from the returned listing instead of trying to commit them all at once to help avoid heap exhaustion.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              msclarke Matthew Clarke
              Votes:
              1 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: