[DERBY-4007] Optimization of IN with nested SELECT - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Minor
Resolution: Unresolved
Affects Version/s: 10.4.2.0
Fix Version/s: None
Component/s: SQL
Labels:
- derby_triage10_10
Environment:
Linux

Urgency:
Normal
Issue & fix info:

Repro attached
Bug behavior facts:

Performance

Description

The problem is with the following query:

UPDATE summa_records SET base='foobar' WHERE id IN ( SELECT parentId FROM summa_relations WHERE childId='horizon_2615441');

It takes in the order of 30s to run when we expect something in the order of 1-2ms.

We have a setup with two tables

summa_records: 1,5M rows
summa_relations: ~350000 rows

summa_records have and 'id' column that is also indexed and is the primary key. The summa_relations table holds mappings between different ids.

In our case the nested SELECT produces 2 hits, say, 'foo' and 'bar'. So the UPDATE on these two hits should be quite snappy. If we run the SELECT alone it runs in an instant, and also if we run with hardcoded ids for the IN clause:

UPDATE summa_records SET base='foobar' WHERE id IN ('foo', 'bar');

We have instant execution. I'll attach a query plan in a sec.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

derby.log
08/Jan/09 15:46
4 kB
Mikkel Kamstrup Erlandsen
dblook.log
13/Jan/09 14:28
1.0 kB
Mikkel Kamstrup Erlandsen
derby_p_index.log
13/Jan/09 15:02
5 kB
Mikkel Kamstrup Erlandsen
dblook_p_index.log
13/Jan/09 15:04
1.0 kB
Mikkel Kamstrup Erlandsen
CreateDatabase4007.java
03/Jul/09 13:14
1 kB
Knut Anders Hatlen

Activity

People

Assignee:: Unassigned

Reporter:: Mikkel Kamstrup Erlandsen

Votes:: 1 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 08/Jan/09 15:44

Updated:: 30/Sep/12 01:53