Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Impala 2.11.0
-
ghx-label-2
Description
The stress test (concurrent_select.py) cannot find queries for the tpch_nested workload. The problem seems to be that the file names within the directory do not contain the full workload name, but just tpch. The offending code is here:
1029 def load_tpc_queries(workload, load_in_kudu=False): 1030 """Returns a list of TPC queries. 'workload' should either be 'tpch' or 'tpcds'. 1031 If 'load_in_kudu' is True, it loads only queries specified for the Kudu storage 1032 engine. 1033 """ 1034 LOG.info("Loading %s queries", workload) 1035 queries = list() 1036 query_dir = os.path.join( 1037 os.path.dirname(__file__), "..", "..", "testdata", "workloads", workload, "queries") 1038 engine = 'kudu-' if load_in_kudu else '' 1039 file_name_pattern = re.compile(r"%s-%s(q\d+).test$" % (workload, engine)) 1040 for query_file in os.listdir(query_dir): ... 1888 if args.tpch_nested_db: 1889 tpch_nested_queries = load_tpc_queries("tpch_nested")
But here are the queries:
$ ls testdata/workloads/tpch_nested/queries tpch-q10.test tpch-q15.test tpch-q1.test tpch-q3.test tpch-q8.test tpch-q11.test tpch-q16.test tpch-q20.test tpch-q4.test tpch-q9.test tpch-q12.test tpch-q17.test tpch-q21.test tpch-q5.test tpch-q13.test tpch-q18.test tpch-q22.test tpch-q6.test tpch-q14.test tpch-q19.test tpch-q2.test tpch-q7.test $
As you can see, this means the code isn't going to find any queries in that directory.