Details
-
New Feature
-
Status: Open
-
Major
-
Resolution: Unresolved
-
4.0-ALPHA
-
None
-
None
-
New
Description
In order to offer user's near realtime search, without incurring
an indexing performance penalty, we can implement search on
IndexWriter's RAM buffer. This is the buffer that is filled in
RAM as documents are indexed. Currently the RAM buffer is
flushed to the underlying directory (usually disk) before being
made searchable.
Todays Lucene based NRT systems must incur the cost of merging
segments, which can slow indexing.
Michael Busch has good suggestions regarding how to handle deletes using max doc ids.
https://issues.apache.org/jira/browse/LUCENE-2293?focusedCommentId=12841923&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12841923
The area that isn't fully fleshed out is the terms dictionary,
which needs to be sorted prior to queries executing. Currently
IW implements a specialized hash table. Michael B has a
suggestion here:
https://issues.apache.org/jira/browse/LUCENE-2293?focusedCommentId=12841915&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12841915
Attachments
Attachments
Issue Links
- incorporates
-
LUCENE-2575 Concurrent byte and int block implementations
- Open
-
LUCENE-3199 Add non-desctructive sort to BytesRefHash
- Open
-
LUCENE-3399 Enable replace-able field caches
- Open
- is blocked by
-
LUCENE-2324 Per thread DocumentsWriters that write their own private segments
- Resolved
- is related to
-
CASSANDRA-2915 Lucene based Secondary Indexes
- Resolved
- relates to
-
LUCENE-2346 Explore other in-memory postinglist formats for realtime search
- Open