Details
-
Bug
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
Impala 4.0.0
-
ghx-label-4
Description
single_node_perf_run.py has a required argument "scale". If scale > 1, the script runs fine. But if scale = 1 and load is true, the data loading script will fail due to missing dataset. This is becasue the preload script omit the scale number padding when creating dataset directory.
ie., tpch scale 1 will create dataset dir "testdata/impala-data/tpch".
On the other hand, generate-schema-statements.py will create template sql referring to "testdata/impala-data/tpch1".
Consider creating symlink if scale factor = 1 in the preload script.