Details
-
Task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Kudu_Impala
Description
Impala relies on bloom filters to reduce number of rows from coming out of the scan node for selective joins.
Queries get up to 20x speedup, not having bloom filter support in Kudu will create a big performance gap between Parquet and Kudu.
https://github.com/cloudera/Impala/blob/cdh5-trunk/be/src/util/bloom-filter.h
Attachments
Issue Links
- is related to
-
IMPALA-9691 Support Kudu Timestamp and Date Bloom Filter
-
- Resolved
-
-
IMPALA-4252 Add RuntimeFilters for min/max at KuduScanNode
-
- Resolved
-
- relates to
-
KUDU-2483 Scan tablets with bloom filter
-
- Resolved
-
-
IMPALA-9789 Disable ineffective bloom filters for Kudu scan
-
- Resolved
-