STAGE DEPENDENCIES: Stage-1 is a root stage Stage-0 depends on stages: Stage-1 STAGE PLANS: Stage: Stage-1 Tez Edges: Reducer 3 <- Map 2 (SIMPLE_EDGE), Map 9 (SIMPLE_EDGE) Reducer 4 <- Map 8 (SIMPLE_EDGE), Reducer 3 (SIMPLE_EDGE) Reducer 5 <- Map 1 (BROADCAST_EDGE), Map 10 (SIMPLE_EDGE), Map 11 (BROADCAST_EDGE), Reducer 4 (SIMPLE_EDGE) Reducer 6 <- Reducer 5 (SIMPLE_EDGE) Reducer 7 <- Reducer 6 (SIMPLE_EDGE) DagName: gopal_20140616134747_519950a2-87b3-49c2-83ff-5a4e22c68ffc:1 Vertices: Map 1 Map Operator Tree: TableScan alias: region filterExpr: (r_name = 'ASIA') (type: boolean) Statistics: Num rows: 5 Data size: 1220 Basic stats: COMPLETE Column stats: NONE Filter Operator predicate: (r_name = 'ASIA') (type: boolean) Statistics: Num rows: 2 Data size: 488 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: r_regionkey (type: int) sort order: + Map-reduce partition columns: r_regionkey (type: int) Statistics: Num rows: 2 Data size: 488 Basic stats: COMPLETE Column stats: NONE value expressions: r_name (type: string) Execution mode: vectorized Map 10 Map Operator Tree: TableScan alias: supplier filterExpr: s_nationkey is not null (type: boolean) Statistics: Num rows: 10000000 Data size: 4714940000 Basic stats: COMPLETE Column stats: NONE Filter Operator predicate: s_nationkey is not null (type: boolean) Statistics: Num rows: 5000000 Data size: 2357470000 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: s_suppkey (type: int), s_nationkey (type: int) sort order: ++ Map-reduce partition columns: s_suppkey (type: int), s_nationkey (type: int) Statistics: Num rows: 5000000 Data size: 2357470000 Basic stats: COMPLETE Column stats: NONE Execution mode: vectorized Map 11 Map Operator Tree: TableScan alias: nation filterExpr: n_regionkey is not null (type: boolean) Statistics: Num rows: 25 Data size: 6425 Basic stats: COMPLETE Column stats: NONE Filter Operator predicate: n_regionkey is not null (type: boolean) Statistics: Num rows: 13 Data size: 3341 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: n_nationkey (type: int) sort order: + Map-reduce partition columns: n_nationkey (type: int) Statistics: Num rows: 13 Data size: 3341 Basic stats: COMPLETE Column stats: NONE value expressions: n_name (type: string), n_regionkey (type: int) Execution mode: vectorized Map 2 Map Operator Tree: TableScan alias: orders filterExpr: ((o_orderdate >= '1994-01-01') and (o_orderdate < '1995-01-01')) (type: boolean) Statistics: Num rows: 1500000000 Data size: 779147483644 Basic stats: COMPLETE Column stats: NONE Filter Operator predicate: ((o_orderdate >= '1994-01-01') and (o_orderdate < '1995-01-01')) (type: boolean) Statistics: Num rows: 166666666 Data size: 86571942280 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: o_custkey (type: int) sort order: + Map-reduce partition columns: o_custkey (type: int) Statistics: Num rows: 166666666 Data size: 86571942280 Basic stats: COMPLETE Column stats: NONE value expressions: o_orderkey (type: int), o_orderdate (type: string) Execution mode: vectorized Map 8 Map Operator Tree: TableScan alias: lineitem filterExpr: (l_orderkey is not null and l_suppkey is not null) (type: boolean) Statistics: Num rows: 5999989709 Data size: 4745677733354 Basic stats: COMPLETE Column stats: PARTIAL Filter Operator predicate: (l_orderkey is not null and l_suppkey is not null) (type: boolean) Statistics: Num rows: 1499997428 Data size: 11999979424 Basic stats: COMPLETE Column stats: PARTIAL Reduce Output Operator key expressions: l_orderkey (type: int) sort order: + Map-reduce partition columns: l_orderkey (type: int) Statistics: Num rows: 1499997428 Data size: 11999979424 Basic stats: COMPLETE Column stats: PARTIAL value expressions: l_suppkey (type: int), l_extendedprice (type: double), l_discount (type: double) Execution mode: vectorized Map 9 Map Operator Tree: TableScan alias: customer filterExpr: (c_custkey is not null and c_nationkey is not null) (type: boolean) Statistics: Num rows: 150000000 Data size: 86097600000 Basic stats: COMPLETE Column stats: NONE Filter Operator predicate: (c_custkey is not null and c_nationkey is not null) (type: boolean) Statistics: Num rows: 37500000 Data size: 21524400000 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: c_custkey (type: int) sort order: + Map-reduce partition columns: c_custkey (type: int) Statistics: Num rows: 37500000 Data size: 21524400000 Basic stats: COMPLETE Column stats: NONE value expressions: c_nationkey (type: int) Execution mode: vectorized Reducer 3 Reduce Operator Tree: Join Operator condition map: Inner Join 0 to 1 condition expressions: 0 {KEY.reducesinkkey0} {VALUE._col2} 1 {VALUE._col0} {KEY.reducesinkkey0} {VALUE._col3} outputColumnNames: _col0, _col3, _col10, _col11, _col14 Statistics: Num rows: 183333344 Data size: 95229140992 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: _col10 (type: int) sort order: + Map-reduce partition columns: _col10 (type: int) Statistics: Num rows: 183333344 Data size: 95229140992 Basic stats: COMPLETE Column stats: NONE value expressions: _col0 (type: int), _col3 (type: int), _col11 (type: int), _col14 (type: string) Reducer 4 Reduce Operator Tree: Join Operator condition map: Inner Join 0 to 1 condition expressions: 0 {VALUE._col0} {VALUE._col3} {KEY.reducesinkkey0} {VALUE._col10} {VALUE._col13} 1 {KEY.reducesinkkey0} {VALUE._col1} {VALUE._col4} {VALUE._col5} outputColumnNames: _col0, _col3, _col10, _col11, _col14, _col21, _col23, _col26, _col27 Statistics: Num rows: 1649997184 Data size: 13199977472 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: _col23 (type: int), _col3 (type: int) sort order: ++ Map-reduce partition columns: _col23 (type: int), _col3 (type: int) Statistics: Num rows: 1649997184 Data size: 13199977472 Basic stats: COMPLETE Column stats: NONE value expressions: _col0 (type: int), _col10 (type: int), _col11 (type: int), _col14 (type: string), _col21 (type: int), _col26 (type: double), _col27 (type: double) Reducer 5 Reduce Operator Tree: Join Operator condition map: Inner Join 0 to 1 condition expressions: 0 {VALUE._col0} {KEY.reducesinkkey1} {VALUE._col9} {VALUE._col10} {VALUE._col13} {VALUE._col20} {KEY.reducesinkkey0} {VALUE._col24} {VALUE._col25} 1 {KEY.reducesinkkey0} {KEY.reducesinkkey1} outputColumnNames: _col0, _col3, _col10, _col11, _col14, _col21, _col23, _col26, _col27, _col39, _col42 Statistics: Num rows: 1814996992 Data size: 14519975936 Basic stats: COMPLETE Column stats: NONE Map Join Operator condition map: Inner Join 0 to 1 condition expressions: 0 {_col0} {_col3} {_col10} {_col11} {_col14} {_col21} {_col23} {_col26} {_col27} {_col39} {_col42} 1 {n_nationkey} {n_name} {n_regionkey} keys: 0 _col42 (type: int) 1 n_nationkey (type: int) outputColumnNames: _col0, _col3, _col10, _col11, _col14, _col21, _col23, _col26, _col27, _col39, _col42, _col48, _col49, _col50 Statistics: Num rows: 1996496768 Data size: 15971974144 Basic stats: COMPLETE Column stats: NONE Map Join Operator condition map: Inner Join 0 to 1 condition expressions: 0 {_col0} {_col3} {_col10} {_col11} {_col14} {_col21} {_col23} {_col26} {_col27} {_col39} {_col42} {_col48} {_col49} {_col50} 1 {r_regionkey} {r_name} keys: 0 _col50 (type: int) 1 r_regionkey (type: int) outputColumnNames: _col0, _col3, _col10, _col11, _col14, _col21, _col23, _col26, _col27, _col39, _col42, _col48, _col49, _col50, _col54, _col55 Statistics: Num rows: 2196146432 Data size: 17569171456 Basic stats: COMPLETE Column stats: NONE Filter Operator predicate: (((((((((((((((_col0 = _col11) and (_col21 = _col10)) and (_col23 = _col39)) and (_col3 = _col42)) and (_col42 = _col48)) and (_col50 = _col54)) and (_col55 = 'ASIA')) and (_col14 >= '1994-01-01')) and (_col14 < '1995-01-01')) and _col21 is not null) and _col0 is not null) and _col23 is not null) and _col3 is not null) and _col42 is not null) and _col50 is not null) (type: boolean) Statistics: Num rows: 29788 Data size: 238304 Basic stats: COMPLETE Column stats: NONE Select Operator expressions: _col49 (type: string), _col26 (type: double), _col27 (type: double) outputColumnNames: _col49, _col26, _col27 Statistics: Num rows: 29788 Data size: 238304 Basic stats: COMPLETE Column stats: NONE Group By Operator aggregations: sum((_col26 * (1 - _col27))) keys: _col49 (type: string) mode: hash outputColumnNames: _col0, _col1 Statistics: Num rows: 29788 Data size: 238304 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: _col0 (type: string) sort order: + Map-reduce partition columns: _col0 (type: string) Statistics: Num rows: 29788 Data size: 238304 Basic stats: COMPLETE Column stats: NONE value expressions: _col1 (type: double) Reducer 6 Reduce Operator Tree: Group By Operator aggregations: sum(VALUE._col0) keys: KEY._col0 (type: string) mode: mergepartial outputColumnNames: _col0, _col1 Statistics: Num rows: 14894 Data size: 119152 Basic stats: COMPLETE Column stats: NONE Select Operator expressions: _col0 (type: string), _col1 (type: double) outputColumnNames: _col0, _col1 Statistics: Num rows: 14894 Data size: 119152 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: _col1 (type: double) sort order: - Statistics: Num rows: 14894 Data size: 119152 Basic stats: COMPLETE Column stats: NONE value expressions: _col0 (type: string) Reducer 7 Reduce Operator Tree: Select Operator expressions: VALUE._col0 (type: string), KEY.reducesinkkey0 (type: double) outputColumnNames: _col0, _col1 Statistics: Num rows: 14894 Data size: 119152 Basic stats: COMPLETE Column stats: NONE File Output Operator compressed: false Statistics: Num rows: 14894 Data size: 119152 Basic stats: COMPLETE Column stats: NONE table: input format: org.apache.hadoop.mapred.TextInputFormat output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe Stage: Stage-0 Fetch Operator limit: -1 Processor Tree: ListSink