Uploaded image for project: 'Apache NiFi'
  1. Apache NiFi
  2. NIFI-756

Persistent Provenance Repository can avoid deleting events from lucene

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.3.0
    • Core Framework
    • None

    Description

      Currently, when events expire in the repository, they are deleted from the indices. This is very expensive. Since the index is sharded (by default at 500 MB), we can instead just ensure that searches always have a start date no earlier than the first provenance event. This way, we won't retrieve any expired records, but they can remain in the index. When all events in the index have expired (we know, based on the earliest event of the next index), we can simply close all readers/writers for the expired index and delete the entire index. This is far cheaper than continually updating the Lucene indices and would make a huge difference in performance.

      Attachments

        Issue Links

          Activity

            People

              markap14 Mark Payne
              markap14 Mark Payne
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: