Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-637

Multi-thread KuduScanNode

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • Public beta
    • None
    • impala, perf
    • None

    Description

      Currently the KuduScanNode uses blocking APIs to read data from Kudu. This means that, for many queries, although we use the same number of CPU cycles as Parquet, we spend a lot more wall clock time, which makes us look unimpressive.

      We should switch to the simplest multi-threaded model that gets our performance in range of Parquet.

      Attachments

        Activity

          People

            Unassigned Unassigned
            tlipcon Todd Lipcon
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: