[LUCENE-2770] Optimize SegmentMerger to work on atomic (Segment)Readers where possible - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 3.1, 4.0-ALPHA
Component/s: core/index
Labels:
None

Lucene Fields:

New, Patch Available

Description

This is a spin-off from ~~LUCENE-2769~~:

Currently SegmentMerger has some optimizations when it merges segments that are SegmentReaders (e.g. when doing normal indexing or optimizing). But when you do IndexWriter.addIndexes(IndexReader...) the listed IndexReaders may not really be per-segment. SegmentMerger should track down all passed in reads down to the lowest level (Segment)Reader (or other atomic readers like SlowMultiReaderWrapper) and then merge. We can then remove most MultiFields usage (except term merging itsself) and clean up the code.

This especially saves lots of memory for merging norms, as no longer the duplicate norms arrays are created when MultiReaders are used!

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-2770.patch
19/Nov/10 16:14
8 kB
Uwe Schindler
LUCENE-2770-3x.patch
19/Nov/10 16:37
3 kB
Uwe Schindler
LUCENE-2770.patch
19/Nov/10 16:48
9 kB
Uwe Schindler
LUCENE-2770.patch
19/Nov/10 18:16
11 kB
Uwe Schindler
LUCENE-2770-3x.patch
19/Nov/10 18:16
7 kB
Uwe Schindler
LUCENE-2770-optimizeNormsMerging.patch
19/Nov/10 23:18
2 kB
Uwe Schindler

Issue Links

relates to

LUCENE-2769 FilterIndexReader in trunk does not implement getSequentialSubReaders() correctly

Resolved

Activity

People

Assignee:: Uwe Schindler

Reporter:: Uwe Schindler

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 19/Nov/10 16:13

Updated:: 28/Aug/22 12:36

Resolved:: 19/Nov/10 18:31