[HBASE-9272] A parallel, unordered scanner - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Minor
Resolution: Won't Fix
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

The contract of ClientScanner is to return rows in sort order. That limits the order in which region can be scanned.
I propose a simple ParallelScanner that does not have this requirement and queries regions in parallel, return whatever gets returned first.

This is generally useful for scans that filter a lot of data on the server, or in cases where the client can very quickly react to the returned data.

I have a simple prototype (doesn't do error handling right, and might be a bit heavy on the synchronization side - it used a BlockingQueue to hand data between the client using the scanner and the threads doing the scanning, it also could potentially starve some scanners long enugh to time out at the server).
On the plus side, it's only a 130 lines of code.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

ParallelClientScanner.java
20/Aug/13 18:10
4 kB
Lars Hofhansl
ParallelClientScanner.java
21/Aug/13 21:06
7 kB
Lars Hofhansl
9272-trunk-v4.txt
25/Oct/13 02:32
17 kB
Lars Hofhansl
9272-trunk-v3.txt
10/Oct/13 05:04
17 kB
Lars Hofhansl
9272-trunk-v3.txt
23/Oct/13 18:50
17 kB
Michael Stack
9272-trunk-v2.txt
07/Oct/13 21:42
18 kB
Lars Hofhansl
9272-trunk.txt
05/Oct/13 00:05
17 kB
Lars Hofhansl
9272-0.94-v4.txt
12/Sep/13 07:09
17 kB
Lars Hofhansl
9272-0.94-v3.txt
10/Sep/13 21:42
18 kB
Lars Hofhansl
9272-0.94-v2.txt
07/Sep/13 00:41
20 kB
Lars Hofhansl
9272-0.94.txt
06/Sep/13 22:32
16 kB
Lars Hofhansl

Issue Links

relates to

HBASE-1935 Scan in parallel

Closed

Activity

People

Assignee:: Unassigned

Reporter:: Lars Hofhansl

Votes:: 2 Vote for this issue

Watchers:: 26 Start watching this issue

Dates

Created:: 20/Aug/13 17:40

Updated:: 16/Jun/22 17:53

Resolved:: 24/Dec/19 20:10