[SOLR-9599] DocValues performance regression with new iterator API - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 7.0
Fix Version/s: 7.0
Component/s: None
Labels:
None

Description

I did a quick performance comparison of faceting indexed fields (i.e. docvalues are not stored) using method=dv before and after the new docvalues iterator went in (~~LUCENE-7407~~).

5M document index, 21 segments, single valued string fields w/ no missing values.

field cardinality	new_time / old_time
10	2.01
1000	2.02
10000	1.85
100000	1.56
1000000	1.31

So unfortunately, often twice as slow.

See followup messages for tests using real docvalues as well.

Attachments

Issue Links

is broken by

LUCENE-7407 Explore switching doc values to an iterator API

Resolved

is related to

LUCENE-8374 Reduce reads for sparse DocValues

Resolved

relates to

LUCENE-7462 Faster search APIs for doc values

Resolved

Sub-Tasks

1.	Improve faceting performance with FieldCache (single-valued string counts only)		Resolved	Yonik Seeley
2.	Performance regression of numeric field uninversion time		Resolved	Yonik Seeley

Activity

People

Assignee:: Unassigned

Reporter:: Yonik Seeley

Votes:: 4 Vote for this issue

Watchers:: 12 Start watching this issue

Dates

Created:: 05/Oct/16 00:21

Updated:: 08/Jun/19 15:31