[KUDU-1312] expose API for partition-aware scan descriptors - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.9.0
Component/s: client
Labels:
None

Target Version/s:

0.9.0

Description

Most integrations working with Kudu will eventually need to access partition information when scanning so that data locality can be preserved.

We should provide an API which takes scan parameters (start/stop primary key and predicates) and returns a set of scan descriptors, each of which is associated with a data location. Partition pruning should be built in to this API so that each integration does not have to reinvent that particular wheel.

The API should also allow a descriptor to be split into further pieces, so that higher-level integrations can increase parallelism without client-side filtering.

Finally, the descriptors should be serializable so that they can be passed between the C++ and Java clients (mostly to allow Impala to construct the query plan in Java and execute it in C++).

Attachments

Activity

People

Assignee:: Dan Burkert

Reporter:: Dan Burkert

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 25/Jan/16 19:00

Updated:: 19/Apr/16 18:06

Resolved:: 19/Apr/16 18:06