[SOLR-5986] Don't allow runaway queries from harming Solr cluster health or search performance - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Critical
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 5.0
Component/s: search
Labels:
None

Description

The intent of this ticket is to have all distributed search requests stop wasting CPU cycles on requests that have already timed out or are so complicated that they won't be able to execute. We have come across a case where a nasty wildcard query within a proximity clause was causing the cluster to enumerate terms for hours even though the query timeout was set to minutes. This caused a noticeable slowdown within the system which made us restart the replicas that happened to service that one request, the worst case scenario are users with a relatively low zk timeout value will have nodes start dropping from the cluster due to long GC pauses.

amccurry Built a mechanism into Apache Blur to help with the issue in ~~BLUR-142~~ (see commit comment for code, though look at the latest code on the trunk for newer bug fixes).

Solr should be able to either prevent these problematic queries from running by some heuristic (possibly estimated size of heap usage) or be able to execute a thread interrupt on all query threads once the time threshold is met. This issue mirrors what others have discussed on the mailing list: http://mail-archives.apache.org/mod_mbox/lucene-solr-user/200903.mbox/%3C856ac15f0903272054q2dbdbd19kea3c5ba9e105b9d8@mail.gmail.com%3E

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SOLR-5986.patch
14/Aug/14 23:47
22 kB
Anshum Gupta
SOLR-5986.patch
05/Sep/14 20:30
24 kB
Anshum Gupta
SOLR-5986.patch
09/Sep/14 23:08
23 kB
Anshum Gupta
SOLR-5986.patch
10/Sep/14 07:09
20 kB
Anshum Gupta
SOLR-5986.patch
15/Sep/14 19:23
39 kB
Anshum Gupta
SOLR-5986.patch
15/Sep/14 19:26
40 kB
Anshum Gupta
SOLR-5986.patch
15/Sep/14 20:06
40 kB
Anshum Gupta
SOLR-5986.patch
15/Sep/14 21:11
36 kB
Anshum Gupta
SOLR-5986.patch
16/Sep/14 21:41
47 kB
Anshum Gupta
SOLR-5986.patch
17/Sep/14 05:58
47 kB
Anshum Gupta
SOLR-5986.patch
21/Sep/14 17:46
59 kB
Steven Rowe
SOLR-5986.patch
25/Sep/14 19:18
58 kB
Anshum Gupta
SOLR-5986-fixtests.patch
07/Oct/14 06:17
3 kB
Anshum Gupta
SOLR-5986-fixtests.patch
07/Oct/14 17:20
2 kB
Anshum Gupta
SOLR-5986-fixtests.patch
08/Oct/14 06:41
13 kB
Anshum Gupta

Issue Links

is related to

SOLR-6831 Make facet pivots respect timeout from SolrQueryTimeoutImpl

Open

SOLR-6623 NPE in StoredFieldsShardResponseProcessor possible when using TIME_ALLOWED param

Resolved

SOLR-6930 Provide "Circuit Breakers" For Expensive Solr Queries

Resolved

relates to

LUCENE-9036 ExitableDirectoryReader to interrupt DocValues as well

Closed

SOLR-6564 Fix failing ExitableDirectoryReader tests for Solr

Closed

Activity

People

Assignee:: Anshum Gupta

Reporter:: Steve Davids

Votes:: 3 Vote for this issue

Watchers:: 27 Start watching this issue

Dates

Created:: 16/Apr/14 04:19

Updated:: 13/Aug/20 13:26

Resolved:: 12/Oct/14 04:32