[LUCENE-1316] Avoidable synchronization bottleneck in MatchAlldocsQuery$MatchAllScorer - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: 2.3
Fix Version/s: 2.9
Component/s: core/query/scoring
Labels:
None
Environment:

All

Lucene Fields:

New

Description

The isDeleted() method on IndexReader has been mentioned a number of times as a potential synchronization bottleneck. However, the reason this bottleneck occurs is actually at a higher level that wasn't focused on (at least in the threads I read).

In every case I saw where a stack trace was provided to show the lock/block, higher in the stack you see the MatchAllScorer.next() method. In Solr paricularly, this scorer is used for "NOT" queries. We saw incredibly poor performance (order of magnitude) on our load tests for NOT queries, due to this bottleneck. The problem is that every single document is run through this isDeleted() method, which is synchronized. Having an optimized index exacerbates this issues, as there is only a single SegmentReader to synchronize on, causing a major thread pileup waiting for the lock.

By simply having the MatchAllScorer see if there have been any deletions in the reader, much of this can be avoided. Especially in a read-only environment for production where you have slaves doing all the high load searching.

I modified line 67 in the MatchAllDocsQuery
FROM:
if (!reader.isDeleted(id)) {
TO:
if (!reader.hasDeletions() || !reader.isDeleted(id)) {

In our micro load test for NOT queries only, this was a major performance improvement. We also got the same query results. I don't believe this will improve the situation for indexes that have deletions.

Please consider making this adjustment for a future bug fix release.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-1316.patch
25/Jan/09 12:24
17 kB
Michael McCandless
LUCENE-1316.patch
16/Jan/09 16:53
10 kB
Jason Rutherglen
LUCENE-1316.patch
13/Jan/09 21:43
7 kB
Jason Rutherglen
LUCENE_1316.patch
27/Jun/08 19:47
9 kB
Yonik Seeley
LUCENE_1316.patch
27/Jun/08 17:06
6 kB
Yonik Seeley
LUCENE_1316.patch
26/Jun/08 17:44
6 kB
Yonik Seeley
MatchAllDocsQuery.java
25/Jun/08 15:58
4 kB
Todd Feak

Activity

People

Assignee:: Michael McCandless

Reporter:: Todd Feak

Votes:: 1 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 25/Jun/08 15:57

Updated:: 28/Aug/22 11:50

Resolved:: 25/Jan/09 14:39

Time Tracking

Estimated:

Remaining:

Logged:

Not Specified