Plan optimized by CBO. Vertex dependency in root stage Map 1 <- Map 4 (BROADCAST_EDGE) Reducer 2 <- Map 1 (CUSTOM_SIMPLE_EDGE), Map 5 (CUSTOM_SIMPLE_EDGE) Reducer 3 <- Reducer 2 (SIMPLE_EDGE) Stage-0 Fetch Operator limit:-1 Stage-1 Reducer 3 vectorized, llap File Output Operator [FS_65] compressed:false Statistics:Num rows: 83 Data size: 16683 Basic stats: COMPLETE Column stats: COMPLETE table:{"input format:":"org.apache.hadoop.mapred.TextInputFormat","output format:":"org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat","serde:":"org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe"} Select Operator [OP_64] outputColumnNames:["_col0","_col1","_col2","_col3"] Statistics:Num rows: 83 Data size: 16683 Basic stats: COMPLETE Column stats: COMPLETE Group By Operator [OP_63] | aggregations:["sum(VALUE._col0)"] | keys:KEY._col0 (type: string), KEY._col1 (type: int) | outputColumnNames:["_col0","_col1","_col2"] | Statistics:Num rows: 83 Data size: 8964 Basic stats: COMPLETE Column stats: COMPLETE |<-Reducer 2 [SIMPLE_EDGE] vectorized, llap Reduce Output Operator [RS_22] key expressions:_col0 (type: string), _col1 (type: int) Map-reduce partition columns:_col0 (type: string), _col1 (type: int) sort order:++ Statistics:Num rows: 83 Data size: 8964 Basic stats: COMPLETE Column stats: COMPLETE value expressions:_col2 (type: double) Group By Operator [OP_62] aggregations:["sum(_col2)"] keys:_col0 (type: string), _col1 (type: int) outputColumnNames:["_col0","_col1","_col2"] Statistics:Num rows: 83 Data size: 8964 Basic stats: COMPLETE Column stats: COMPLETE Select Operator [OP_61] outputColumnNames:["_col0","_col1","_col2"] Statistics:Num rows: 167 Data size: 10020 Basic stats: COMPLETE Column stats: COMPLETE Map Join Operator [MAPJOIN_60] | condition map:[{"":"Inner Join 0 to 1"}] | keys:{"Reducer 2":"KEY.reducesinkkey0 (type: bigint), KEY.reducesinkkey1 (type: int), KEY.reducesinkkey2 (type: int)","Map 5":"KEY.reducesinkkey0 (type: bigint), KEY.reducesinkkey1 (type: int), KEY.reducesinkkey2 (type: int)"} | outputColumnNames:["_col4","_col6"] | Statistics:Num rows: 167 Data size: 10020 Basic stats: COMPLETE Column stats: COMPLETE |<-Map 5 [CUSTOM_SIMPLE_EDGE] vectorized, llap | Reduce Output Operator [RS_59] | key expressions:_col1 (type: bigint), year(_col2) (type: int), month(_col2) (type: int) | Map-reduce partition columns:_col1 (type: bigint), year(_col2) (type: int), month(_col2) (type: int) | sort order:+++ | Statistics:Num rows: 1668327990 Data size: 113446303320 Basic stats: COMPLETE Column stats: COMPLETE | value expressions:_col0 (type: float), _col2 (type: date) | Select Operator [OP_58] | outputColumnNames:["_col0","_col1","_col2"] | Statistics:Num rows: 1668327990 Data size: 113446303320 Basic stats: COMPLETE Column stats: COMPLETE | Filter Operator [FIL_57] | predicate:(id is not null and month(edate) is not null) (type: boolean) | Statistics:Num rows: 1668327990 Data size: 113446303320 Basic stats: COMPLETE Column stats: COMPLETE | TableScan [TS_6] | alias:t | Statistics:Num rows: 1668327990 Data size: 113446303320 Basic stats: COMPLETE Column stats: COMPLETE |<-Map 1 [CUSTOM_SIMPLE_EDGE] vectorized, llap Reduce Output Operator [RS_56] key expressions:_col0 (type: bigint), year(_col2) (type: int), month(_col2) (type: int) Map-reduce partition columns:_col0 (type: bigint), year(_col2) (type: int), month(_col2) (type: int) sort order:+++ Statistics:Num rows: 452623029 Data size: 28967873856 Basic stats: COMPLETE Column stats: COMPLETE Map Join Operator [MAPJOIN_55] | condition map:[{"":"Inner Join 0 to 1"}] | keys:{"Map 1":"'atype' (type: string)","Map 4":"'atype' (type: string)"} | outputColumnNames:["_col0","_col2"] | Statistics:Num rows: 452623029 Data size: 28967873856 Basic stats: COMPLETE Column stats: COMPLETE |<-Map 4 [BROADCAST_EDGE] vectorized, llap | Reduce Output Operator [RS_52] | key expressions:'atype' (type: string) | Map-reduce partition columns:'atype' (type: string) | sort order:+ | Statistics:Num rows: 3 Data size: 288 Basic stats: COMPLETE Column stats: COMPLETE | Select Operator [OP_51] | Statistics:Num rows: 3 Data size: 288 Basic stats: COMPLETE Column stats: COMPLETE | Filter Operator [FIL_50] | predicate:(account_type = 'atype') (type: boolean) | Statistics:Num rows: 3 Data size: 294 Basic stats: COMPLETE Column stats: COMPLETE | TableScan [TS_3] | alias:at | Statistics:Num rows: 13 Data size: 1274 Basic stats: COMPLETE Column stats: COMPLETE |<-Select Operator [OP_54] outputColumnNames:["_col0","_col2"] Statistics:Num rows: 150874343 Data size: 24139894880 Basic stats: COMPLETE Column stats: COMPLETE Filter Operator [FIL_53] predicate:(((id is not null and (account_type = 'atype')) and year(edate) is not null) and month(edate) is not null) (type: boolean) Statistics:Num rows: 150874343 Data size: 24441643566 Basic stats: COMPLETE Column stats: COMPLETE TableScan [TS_0] alias:a Statistics:Num rows: 603497375 Data size: 97766574750 Basic stats: COMPLETE Column stats: COMPLETE