[LUCENE-693] ConjunctionScorer - more tuneup - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.1
Fix Version/s: None
Component/s: core/search
Labels:
None
Environment:

Windows Server 2003 x64, Java 1.6, pretty large index

Lucene Fields:

New

Description

(See also: #~~LUCENE-443~~)
I did some profile testing with the new ConjuctionScorer in 2.1 and discovered a new bottleneck in ConjunctionScorer.sortScorers. The java.utils.Arrays.sort method is cloning the Scorers array on every sort, which is quite expensive on large indexes because of the size of the 'norms' array within, and isn't necessary.

Here is one possible solution:

private void sortScorers() {
// squeeze the array down for the sort
// if (length != scorers.length)

{ // Scorer[] temps = new Scorer[length]; // System.arraycopy(scorers, 0, temps, 0, length); // scorers = temps; // }

insertionSort( scorers,length );
// note that this comparator is not consistent with equals!
// Arrays.sort(scorers, new Comparator() { // sort the array
// public int compare(Object o1, Object o2)

{ // return ((Scorer)o1).doc() - ((Scorer)o2).doc(); // }

// });

first = 0;
last = length - 1;
}
private void insertionSort( Scorer[] scores, int len)
{
for (int i=0; i<len; i++) {
for (int j=i; j>0 && scores[j-1].doc() > scores[j].doc();j-- )

{ swap (scores, j, j-1); }

}
return;
}
private void swap(Object[] x, int a, int b)

{ Object t = x[a]; x[a] = x[b]; x[b] = t; }

The squeezing of the array is no longer needed.
We also initialized the Scorers array to 8 (instead of 2) to avoid having to grow the array for common queries, although this probably has less performance impact.

This change added about 3% to query throughput in my testing.

Peter

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

conjunction.patch
21/Nov/07 07:15
16 kB
Yonik Seeley
conjunction.patch
24/Oct/06 17:17
15 kB
Yonik Seeley
conjunction.patch
24/Oct/06 15:24
15 kB
Yonik Seeley
conjunction.patch
24/Oct/06 05:16
10 kB
Yonik Seeley
conjunction.patch.nosort1
26/Oct/06 02:52
9 kB
Yonik Seeley

Activity

People

Assignee:: Unassigned

Reporter:: Peter Keegan

Votes:: 1 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 23/Oct/06 22:36

Updated:: 28/Aug/22 11:31

Resolved:: 23/Nov/07 17:01