Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
In kylin-it, we use query under sql_distinct_precisely folder to test the COUNT_DISTINCT(Bitmap) . But we find that query04 using a COUNT_DISTINCT(HLL) in having condition, it is unreasonable and can cause some data reduction. And I think it maybe causing some unpredictable test failure.
select test_cal_dt.cal_dt,sum(test_kylin_fact.price) as GMV
, count(1) as TRANS_CNT
, count(distinct TEST_COUNT_DISTINCT_BITMAP) as user_count
, count(distinct site_name) as site_count
from test_kylin_fact
inner JOIN edw.test_cal_dt as test_cal_dt
ON test_kylin_fact.cal_dt = test_cal_dt.cal_dt
inner JOIN test_category_groupings
on test_kylin_fact.leaf_categ_id = test_category_groupings.leaf_categ_id and
test_kylin_fact.lstg_site_id = test_category_groupings.site_id
inner JOIN edw.test_sites as test_sites
on test_kylin_fact.lstg_site_id = test_sites.site_id
inner JOIN edw.test_seller_type_dim as test_seller_type_dim
on test_kylin_fact.slr_segment_cd = test_seller_type_dim.seller_type_cd
where test_kylin_fact.lstg_format_name='FP-GTC'
and test_cal_dt.cal_dt between DATE '2013-05-01' and DATE '2013-08-01'
group by test_cal_dt.cal_dt
having count(distinct seller_id) > 2
In our jenkin server, sometime we got a build failure, but when I run again without modify code, the CI test pass.
Attachments
Attachments
Issue Links
- links to