Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Impala 2.6.0, Impala 2.7.0, Impala 2.8.0
Description
Looks like to_date() pays a steep penalty when converting timestamps.
+-----------------------------------------------------------------------------------------+ | version() | +-----------------------------------------------------------------------------------------+ | impalad version 2.6.0-cdh5.8.0 RELEASE (build 5464d1750381b40a7e7163b12b09f11b891b4de3) | | Built on Thu, 16 Jun 2016 12:43:48 PST | +-----------------------------------------------------------------------------------------+
-- single column timestamp parquet table of 100,000,000 rows select l_shipdate, count(*) from ts1 group by 1; Fetched 2526 row(s) in 11.25s
select
trunc(l_shipdate,'DD'),
count(*)
from ts1
group by 1;
Fetched 2526 row(s) in 10.74s
select to_date(l_shipdate), count(*) from ts1 group by 1; Fetched 2526 row(s) in 102.36s <<< ~10x slower
Attachments
Attachments
Issue Links
- duplicates
-
IMPALA-1703 Date/Time functions are very slow
- Resolved
- is duplicated by
-
IMPALA-3862 to_date() function used in analytic functions has serious performance impact
- Resolved
- is related to
-
IMPALA-5298 Investigate and fix from_unixtime() slowness, currently 100x slower than scanning Bigint
- Resolved