Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Impala 2.5.0
Description
HdfsScanNode::WaitForPartitionFilters() shouldn't only wait for partition filters, as there is often a lot of value in doing row-based filtering as well.
However, we should also avoid waiting for filters that we can't apply (e.g. if row filtering is turned off, or if the file format is not parquet). The best thing to do is probably to wait separately, in the parquet column reader, since that gives row filters the maximum time to arrive and doesn't affect waiting for partition filters earlier in scanner's pipeline.