Query (id=564d9fe6e4f923db:437b5d9c0d83eea1): Summary: Session ID: 864fd1d0c39c85a7:8f01bacf3058bfb6 Session Type: BEESWAX Start Time: 2015-02-27 18:13:34.593166000 End Time: 2015-02-27 18:39:28.763121000 Query Type: QUERY Query State: FINISHED Query Status: OK Impala Version: impalad version 2.2.0-cdh5-INTERNAL RELEASE (build a208b95604bafa9e6b5dd4167bda4f7554e544a5) User: tbobrovytsky Connected User: tbobrovytsky Delegated User: Network Address: 172.21.0.188:56694 Default Db: tpch300gb_parquet Sql Statement: select LEAST(COALESCE((supplier_no) * (supplier_no), -355), COALESCE(MIN((supplier_no) * (supplier_no)), 137)) AS int_col, COUNT(COALESCE((supplier_no) * (supplier_no), supplier_no)) AS int_col_2, (supplier_no) + (supplier_no) AS int_col_3, (supplier_no) * (supplier_no) AS int_col_4 FROM revenue GROUP BY (supplier_no) + (supplier_no), (supplier_no) * (supplier_no) UNION SELECT (t1.l_suppkey) * (t1.l_quantity) AS decimal_col, t1.l_orderkey, LEAST(COALESCE(MIN(868) OVER (), -233), -755.787102873) AS float_col, COALESCE(SUM(t1.l_suppkey), t1.l_orderkey, LEAST(COALESCE(t1.l_orderkey, 650), COALESCE(t1.l_orderkey, 517))) AS int_col FROM lineitem t1 WHERE (t1.l_suppkey) != (t1.l_tax) GROUP BY (t1.l_suppkey) * (t1.l_quantity), t1.l_orderkey Coordinator: e1118.halxg.cloudera.com:22000 Plan: ---------------- Estimated Per-Host Requirements: Memory=14.75GB VCores=2 F07:PLAN FRAGMENT [UNPARTITIONED] 15:EXCHANGE [UNPARTITIONED] hosts=1 per-host-mem=unavailable tuple-ids=5 row-size=40B cardinality=179998909 F06:PLAN FRAGMENT [HASH(int_col,int_col_2,int_col_3,int_col_4)] DATASTREAM SINK [FRAGMENT=F07, EXCHANGE=15, UNPARTITIONED] 14:AGGREGATE [FINALIZE] | group by: int_col, int_col_2, int_col_3, int_col_4 | hosts=1 per-host-mem=7.38GB | tuple-ids=5 row-size=40B cardinality=179998909 | 13:EXCHANGE [HASH(int_col,int_col_2,int_col_3,int_col_4)] hosts=1 per-host-mem=0B tuple-ids=5 row-size=40B cardinality=179998909 F05:PLAN FRAGMENT [RANDOM] DATASTREAM SINK [FRAGMENT=F06, EXCHANGE=13, HASH(int_col,int_col_2,int_col_3,int_col_4)] 06:AGGREGATE | group by: int_col, int_col_2, int_col_3, int_col_4 | hosts=1 per-host-mem=7.38GB | tuple-ids=5 row-size=40B cardinality=179998909 | 00:UNION | hosts=1 per-host-mem=0B | tuple-ids=5 row-size=40B cardinality=179998909 | |--12:EXCHANGE [RANDOM] | hosts=10 per-host-mem=0B | tuple-ids=3,6 row-size=34B cardinality=179998909 | 08:AGGREGATE [FINALIZE] | output: min:merge((supplier_no) * (supplier_no)), count:merge(coalesce((supplier_no) * (supplier_no), supplier_no)) | group by: (supplier_no) + (supplier_no), (supplier_no) * (supplier_no) | hosts=1 per-host-mem=10.00MB | tuple-ids=1 row-size=32B cardinality=0 | 07:EXCHANGE [HASH((supplier_no) + (supplier_no),(supplier_no) * (supplier_no))] hosts=1 per-host-mem=0B tuple-ids=1 row-size=32B cardinality=0 F04:PLAN FRAGMENT [UNPARTITIONED] DATASTREAM SINK [FRAGMENT=F05, EXCHANGE=12, RANDOM] 05:ANALYTIC | functions: min(868) | hosts=10 per-host-mem=unavailable | tuple-ids=3,6 row-size=34B cardinality=179998909 | 11:EXCHANGE [UNPARTITIONED] hosts=10 per-host-mem=unavailable tuple-ids=3 row-size=32B cardinality=179998909 F03:PLAN FRAGMENT [HASH((t1.l_suppkey) * (t1.l_quantity),t1.l_orderkey)] DATASTREAM SINK [FRAGMENT=F04, EXCHANGE=11, UNPARTITIONED] 10:AGGREGATE [FINALIZE] | output: sum:merge(t1.l_suppkey) | group by: (t1.l_suppkey) * (t1.l_quantity), t1.l_orderkey | hosts=10 per-host-mem=5.90GB | tuple-ids=3 row-size=32B cardinality=179998909 | 09:EXCHANGE [HASH((t1.l_suppkey) * (t1.l_quantity),t1.l_orderkey)] hosts=10 per-host-mem=0B tuple-ids=3 row-size=32B cardinality=179998909 F02:PLAN FRAGMENT [RANDOM] DATASTREAM SINK [FRAGMENT=F03, EXCHANGE=09, HASH((t1.l_suppkey) * (t1.l_quantity),t1.l_orderkey)] 04:AGGREGATE | output: sum(t1.l_suppkey) | group by: (t1.l_suppkey) * (t1.l_quantity), t1.l_orderkey | hosts=10 per-host-mem=5.90GB | tuple-ids=3 row-size=32B cardinality=179998909 | 03:SCAN HDFS [tpch300gb_parquet.lineitem t1, RANDOM] partitions=1/1 files=182 size=64.36GB predicates: (t1.l_suppkey) != (t1.l_tax) table stats: 1799989091 rows total column stats: all hosts=10 per-host-mem=352.00MB tuple-ids=2 row-size=32B cardinality=179998909 F00:PLAN FRAGMENT [RANDOM] DATASTREAM SINK [FRAGMENT=F05, EXCHANGE=07, HASH((supplier_no) + (supplier_no),(supplier_no) * (supplier_no))] 02:AGGREGATE | output: min((supplier_no) * (supplier_no)), count(coalesce((supplier_no) * (supplier_no), supplier_no)) | group by: (supplier_no) + (supplier_no), (supplier_no) * (supplier_no) | hosts=1 per-host-mem=10.00MB | tuple-ids=1 row-size=32B cardinality=0 | 01:SCAN HDFS [tpch300gb_parquet.revenue, RANDOM] partitions=1/1 files=10 size=37.99MB table stats: 0 rows total column stats: all hosts=1 per-host-mem=16.00MB tuple-ids=0 row-size=8B cardinality=0 ---------------- Estimated Per-Host Mem: 15839904256 Estimated Per-Host VCores: 2 Request Pool: root.default