Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Impala 4.4.0
-
None
-
ghx-label-2
Description
The tpc-h and tpc-ds components of the toolchain do not enable any kind of compiler optimization flags. This is irrelevant to Impala's shipped binary, but it does impact the performance of the data generators for TPC-H and TPC-DS. Turning on -O3 seems to improve the data generation time by ~25%.
##### TPC-H ######## # Unoptimized $ time ./dbgen -f -s 42 TPC-H Population Generator (Version 2.17.0) Copyright Transaction Processing Performance Council 1994 - 2010 real 4m46.269s user 4m20.982s sys 0m19.390s # -O3 $ time ./dbgen -f -s 42 TPC-H Population Generator (Version 2.17.0) Copyright Transaction Processing Performance Council 1994 - 2010 real 3m46.379s user 3m23.721s sys 0m18.436s ##### TPC-DS ####### # Unoptimized $ time ./dsdgen -force -scale 20 DBGEN2 Population Generator (Version 2.0.0) Copyright Transaction Processing Performance Council (TPC) 2001 - 2015 Warning: Selected scale factor is NOT valid for result publication real 9m41.441s user 8m3.447s sys 1m37.944s # -O3 $ time ./dsdgen -force -scale 20 DBGEN2 Population Generator (Version 2.0.0) Copyright Transaction Processing Performance Council (TPC) 2001 - 2015 Warning: Selected scale factor is NOT valid for result publication real 7m25.017s user 5m48.487s sys 1m36.265s
We should modify the toolchain to add -O3 to these builds.