[LUCENE-5611] Simplify the default indexing chain - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 4.9, 6.0
Component/s: core/index
Labels:
None

Lucene Fields:

New

Description

I think Lucene's current indexing chain has too many classes /
hierarchy / abstractions, making it look much more complex than it
really should be, and discouraging users from experimenting/innovating
with their own indexing chains.

Also, if it were easier to understand/approach, then new developers
would more likely try to improve it ... it really should be simpler.

So I'm exploring a pared back indexing chain, and have a starting patch
that I think is looking ok: it seems more approachable than the
current indexing chain, or at least has fewer strange classes.

I also thought this could give some speedup for tiny documents (a more
common use of Lucene lately), and it looks like, with the evil
optimizations, this is a ~25% speedup for Geonames docs. Even without
those evil optos it's a bit faster.

This is very much a work in progress / nocommits, and there are some
behavior changes e.g. the new chain requires all fields to have the
same TV options (rather than auto-upgrading all fields by the same
name that the current chain does)...

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-5611.patch
24/Apr/14 15:40
189 kB
Michael McCandless
LUCENE-5611.patch
16/Apr/14 21:35
176 kB
Michael McCandless

Activity

People

Assignee:: Michael McCandless

Reporter:: Michael McCandless

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 16/Apr/14 21:27

Updated:: 28/Aug/22 14:05

Resolved:: 29/Apr/14 21:47