[LUCENE-5316] Taxonomy tree traversing improvement - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Minor
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: modules/facet
Labels:
None

Lucene Fields:

New

Description

The taxonomy traversing is done today utilizing the ParallelTaxonomyArrays. In particular, two taxonomy-size int arrays which hold for each ordinal it's (array #1) youngest child and (array #2) older sibling.

This is a compact way of holding the tree information in memory, but it's not perfect:

Large (8 bytes per ordinal in memory)
Exposes internal implementation
Utilizing these arrays for tree traversing is not straight forward
Lose reference locality while traversing (the array is accessed in increasing only entries, but they may be distant from one another)
In NRT, a reopen is always (not worst case) done at O(Taxonomy-size)

This issue is about making the traversing more easy, the code more readable, and open it for future improvements (i.e memory footprint and NRT cost) - without changing any of the internals.
A later issue(s?) could be opened to address the gaps once this one is done.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-5316.patch
17/Nov/13 20:08
44 kB
Gilad Barkai
LUCENE-5316.patch
13/Nov/13 12:01
44 kB
Gilad Barkai
LUCENE-5316.patch
05/Nov/13 20:06
44 kB
Gilad Barkai
LUCENE-5316.patch
03/Nov/13 07:14
40 kB
Gilad Barkai
LUCENE-5316.patch
30/Oct/13 14:21
37 kB
Gilad Barkai

Activity

People

Assignee:: Unassigned

Reporter:: Gilad Barkai

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 30/Oct/13 09:15

Updated:: 28/Aug/22 13:56