[SOLR-3109] group=true requests result in numerous redundant shard requests - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Critical
Resolution: Fixed
Affects Version/s: 3.5, 4.0-ALPHA
Fix Version/s: 3.6, 4.0-ALPHA
Component/s: search
Labels:
- patch
- performance
Environment:

64-bit Linux, sharded environment

Description

During the second phase of a group query, the collator sends a query to each of the shards. The purpose of this query is for shards to respond with the doc ids that match the set of group ids returned from the first phase. The problem is that it sends this second query to each shard multiple times. Specifically, in an environment with n shards, each shard will be hit with an identical query n times during the second phase of query processing, resulting in O(n ²) performance where n is the number of shards.

I have traced this bug down to a single line in TopGroupsShardRequestFactory.java, and I am attaching a patch.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SOLR-3109.patch
10/Feb/12 15:48
11 kB
Russell Black
SOLR-3109-lucene_solr_3_5.patch
09/Feb/12 22:40
10 kB
Russell Black
SOLR-3109-Backport-of-grouping-performace-fix-to-3.x.patch
09/Feb/12 21:21
12 kB
Greg Bowyer
SOLR-3109.patch
08/Feb/12 21:14
8 kB
Martijn van Groningen
SOLR-3109.patch
08/Feb/12 05:31
1 kB
Russell Black

Activity

People

Assignee:: Martijn van Groningen

Reporter:: Russell Black

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 08/Feb/12 05:26

Updated:: 10/May/13 10:39

Resolved:: 13/Feb/12 13:09