Using Windows Impala ODBC Driver with no auth mechanism, we are able to run a query that produces about 80millions rows and it finishes in about 20 minutes.
The same query using auth mechanism kerberos (against secured cluster CDH 5.3.1) is taking about 90min.
I've been able to reduce this total time of 90min by 1/3 of the time by increasing the "Rows fetched per block" in the Advanced Options of the ODBC driver from 1000 rows to 100000 rows. This is the best I could get. In short it seems that an ODBC with no auth mechanism configured is faster than with kerberos mechanism.
When running the same query against both cluster (1x with security enabled and one without) the performance is the same. Both are able to complete within aprox 20min.
What can be done in order to improve the performance of the Impala ODBC running with kerberos mechanism?