Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-1928

Impala ODBC bad performance with Kerberos mechanism

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • Impala 2.1.1, Impala 2.3.0
    • Impala 2.6.0, Impala 2.5.2
    • Perf Investigation
    • ODBC Impala 5.2.20 and 5.2.23

    Description

      Using Windows Impala ODBC Driver with no auth mechanism, we are able to run a query that produces about 80millions rows and it finishes in about 20 minutes.

      The same query using auth mechanism kerberos (against secured cluster CDH 5.3.1) is taking about 90min.

      I've been able to reduce this total time of 90min by 1/3 of the time by increasing the "Rows fetched per block" in the Advanced Options of the ODBC driver from 1000 rows to 100000 rows. This is the best I could get. In short it seems that an ODBC with no auth mechanism configured is faster than with kerberos mechanism.

      When running the same query against both cluster (1x with security enabled and one without) the performance is the same. Both are able to complete within aprox 20min.

      What can be done in order to improve the performance of the Impala ODBC running with kerberos mechanism?

      Attachments

        Issue Links

          Activity

            People

              mmokhtar Mostafa Mokhtar
              akhe_impala_cf17 Jonathan L.
              Votes:
              2 Vote for this issue
              Watchers:
              17 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: