[PHOENIX-2797] Ideas to speed up MIN/MAX/DISTINCT for prefixes of the PK - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Duplicate
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

All of MIN, MAX, and DISTINCT always perform a full scan, even when they are on a prefix of a compound key.

For MIN and MAX one only needs to find the first and last row (resp) and we'll have our answer. This works for the full key or a prefix of the key.
This should work find with or without a WHERE clause, as long as we can identify the first and last row.

For DISTINCT we could do a skip scan to the next prefix (only helps with a true prefix of a compound key).
Say the key is (K1, K2), and say further that we're doing DISTINCT(K1). We can skip to the next value of K1 once we found a value. This should have a dramatic impact when the cardinality of K2 is high.
With a WHERE clause that might itself be causing a SKIP SCAN, this might be quite tricky. Would need to think about it.

Both of these statements hold equally when querying against an index.

Anyway... Just filing this as an idea for now.

Attachments

Issue Links

is duplicated by

PHOENIX-258 Use skip scan when SELECT DISTINCT on leading row key column(s)

Closed

Activity

People

Assignee:: Unassigned

Reporter:: Lars Hofhansl

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 25/Mar/16 06:06

Updated:: 23/May/16 16:11

Resolved:: 25/Mar/16 17:47