Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-2312

Search on IndexWriter's RAM Buffer

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 4.0-ALPHA
    • None
    • core/search
    • None
    • New

    Description

      In order to offer user's near realtime search, without incurring
      an indexing performance penalty, we can implement search on
      IndexWriter's RAM buffer. This is the buffer that is filled in
      RAM as documents are indexed. Currently the RAM buffer is
      flushed to the underlying directory (usually disk) before being
      made searchable.

      Todays Lucene based NRT systems must incur the cost of merging
      segments, which can slow indexing.

      Michael Busch has good suggestions regarding how to handle deletes using max doc ids.
      https://issues.apache.org/jira/browse/LUCENE-2293?focusedCommentId=12841923&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12841923

      The area that isn't fully fleshed out is the terms dictionary,
      which needs to be sorted prior to queries executing. Currently
      IW implements a specialized hash table. Michael B has a
      suggestion here:
      https://issues.apache.org/jira/browse/LUCENE-2293?focusedCommentId=12841915&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12841915

      Attachments

        1. LUCENE-2312.patch
          149 kB
          Jason Rutherglen
        2. LUCENE-2312.patch
          102 kB
          Jason Rutherglen
        3. LUCENE-2312-FC.patch
          4 kB
          Jason Rutherglen
        4. LUCENE-2312.patch
          71 kB
          Jason Rutherglen

        Issue Links

          Activity

            People

              michaelbusch Michael Busch
              jasonrutherglen Jason Rutherglen

              Dates

                Created:
                Updated:

                Slack

                  Issue deployment