[LUCENE-2312] Search on IndexWriter's RAM Buffer - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 4.0-ALPHA
Fix Version/s: None
Component/s: core/search
Labels:
None

Lucene Fields:

New

Description

In order to offer user's near realtime search, without incurring
an indexing performance penalty, we can implement search on
IndexWriter's RAM buffer. This is the buffer that is filled in
RAM as documents are indexed. Currently the RAM buffer is
flushed to the underlying directory (usually disk) before being
made searchable.

Todays Lucene based NRT systems must incur the cost of merging
segments, which can slow indexing.

Michael Busch has good suggestions regarding how to handle deletes using max doc ids.
https://issues.apache.org/jira/browse/LUCENE-2293?focusedCommentId=12841923&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12841923

The area that isn't fully fleshed out is the terms dictionary,
which needs to be sorted prior to queries executing. Currently
IW implements a specialized hash table. Michael B has a
suggestion here:
https://issues.apache.org/jira/browse/LUCENE-2293?focusedCommentId=12841915&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12841915

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-2312.patch
30/Aug/11 16:46
149 kB
Jason Rutherglen
LUCENE-2312.patch
24/Aug/11 22:52
102 kB
Jason Rutherglen
LUCENE-2312.patch
13/Oct/10 17:31
71 kB
Jason Rutherglen
LUCENE-2312-FC.patch
20/Dec/10 22:22
4 kB
Jason Rutherglen

Issue Links

incorporates

LUCENE-2575 Concurrent byte and int block implementations

Open

LUCENE-3199 Add non-desctructive sort to BytesRefHash

Open

LUCENE-3399 Enable replace-able field caches

Open

is blocked by

LUCENE-2324 Per thread DocumentsWriters that write their own private segments

Resolved

is related to

CASSANDRA-2915 Lucene based Secondary Indexes

Resolved

relates to

LUCENE-2346 Explore other in-memory postinglist formats for realtime search

Open

(1 relates to)

Activity

People

Assignee:: Michael Busch

Reporter:: Jason Rutherglen

Votes:: 0 Vote for this issue

Watchers:: 15 Start watching this issue

Dates

Created:: 12/Mar/10 23:09

Updated:: 28/Aug/22 12:21