Uploaded image for project: 'Jackrabbit Content Repository'
  1. Jackrabbit Content Repository
  2. JCR-2311

Improve indexing performance by reusing already create Lucene Documents within a single indexing transaction

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Later
    • None
    • None
    • indexing
    • None

    Description

      When using indexing aggregates, it can become very expensive when for example binary data like pdfs are involved. Within a single MultiIndex transaction, we should be save to reuse the created Lucene Document's. This is useful for aggregates, as this might currently involve reindexing the same node multiple times now in a single transaction.

      In the mail discussion below I describe to use WeakReferences, though I think SoftReferences are in this case better suited (we do not want the created lucene docs to be gc-ed, unless memory gets low...)

      Also see http://www.nabble.com/Improve-indexing-performance-wrt-Aggregates-td25471828.html

      Attachments

        Activity

          People

            aschrijvers Ard Schrijvers
            aschrijvers Ard Schrijvers
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: