Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-1306 timeseries features and improvements
  3. KUDU-1312

expose API for partition-aware scan descriptors

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.9.0
    • Component/s: client
    • Labels:
      None
    • Target Version/s:

      Description

      Most integrations working with Kudu will eventually need to access partition information when scanning so that data locality can be preserved.

      We should provide an API which takes scan parameters (start/stop primary key and predicates) and returns a set of scan descriptors, each of which is associated with a data location. Partition pruning should be built in to this API so that each integration does not have to reinvent that particular wheel.

      The API should also allow a descriptor to be split into further pieces, so that higher-level integrations can increase parallelism without client-side filtering.

      Finally, the descriptors should be serializable so that they can be passed between the C++ and Java clients (mostly to allow Impala to construct the query plan in Java and execute it in C++).

        Attachments

          Activity

            People

            • Assignee:
              danburkert Dan Burkert
              Reporter:
              danburkert Dan Burkert
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: