Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
None
-
Sprint 50
Description
1.Set the measures to count distinct and select the precise type column.
2.Set cube —— Configuration Overwrites:
Key:kylin.dictionary.mr-hive.columns
Value: the alias of the global dictionary column is required. Multiple columns are separated by commas,Eg:KYLIN_SALES_SELLER_ID,KYLIN_SALES_BUYER_ID
Not necessary.By default, there are two reduce for each column. The more the number, the higher the concurrency.
1.Set cube —— Configuration Overwrites
Key:kylin.dictionary.mr-hive.columns.reduce.num
Value: the alias of reduce dictionary column needs to be set: the number of reduce columns, separated by commas, Eg: KYLIN_SALES_SELLER_ID:3
Reuse Mr/Hive global dictionary columns of other cubes to avoid building dictionaries repeatedly
1.cube——Configuration Overwritese
Key: kylin.dictionary.mr-hive.ref.columns
Value: globalDictColumnName:referencedCubeName_referencedColumnName, multiple columns separated by commas,Eg: KYLIN_SALES_BUYER_ID global MR/Hive Dict reuse cube kylin_sales_cube_spark_notopn's global MR/Hive column KYLIN_SALES_BUYER_ID, we can config: KYLIN_SALES_BUYER_ID:kylin_sales_cube_spark_notopn.KYLIN_SALES_BUYER_ID