-
Type:
Improvement
-
Status: Resolved
-
Priority:
Major
-
Resolution: Fixed
-
Affects Version/s: None
-
Fix Version/s: Impala 4.0
-
Component/s: Frontend
-
Labels:
-
Epic Link:
-
Epic Color:ghx-label-10
Currently the number of instances (used for various estimates) is calculated as estimated # hosts * mt_dop. This is suboptimal, because in many cases there isn't enough input data to allow creating that many instances per host.
The per-host estimate for scans actually already handles this correctly by capping the hosts estimate at the number of input splits - we should use the same approach for number of instance estimates.