[SOLR-659] Explicitly set start and rows per shard for more efficient bulk queries across distributed Solr - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: 1.3
Fix Version/s: 1.4
Component/s: search
Labels:
None

Description

The default behavior of setting start and rows on distributed solr (~~SOLR-303~~) is to set start at 0 across all shards and set rows to start+rows across each shard. This ensures all results are returned for any arbitrary start and rows setting, but during "bulk queries" (where start is incrementally increased and rows is kept consistent) the client would need finer control of the per-shard start and rows parameter as retrieving many thousands of documents becomes intractable as start grows higher.

Attaching a patch that creates a &shards.start and &shards.rows parameter. If used, the logic that sets rows to start+rows per shard is overridden and each shard gets the exact start and rows set in shards.start and shards.rows. The client will receive up to shards.rows * nShards results and should set rows accordingly. This makes bulk queries across distributed solr possible.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SOLR-659.patch
08/Feb/09 17:53
3 kB
Brian Whitman
shards.start_rows.patch
25/Jul/08 14:07
3 kB
Brian Whitman

Sub-Tasks

There are no Sub-Tasks for this issue.

Activity

People

Assignee:: Yonik Seeley

Reporter:: Brian Whitman

Votes:: 1 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 25/Jul/08 14:06

Updated:: 10/Nov/09 15:51

Resolved:: 07/Sep/09 18:30