Current optimization in Hive is just rule-based and involves applying a set of rules on the Plan tree. This depends on hints given by the user (which may or may-not be correct) and might result in execution of costlier plans.So this jira aims at building a cost-model which can give a good estimate various plans before hand (using some meta-data already collected) and we can choose the best plan which incurs the least cost.
|Field||Original Value||New Value|
|Summary||Cost Based Query optimization in Hive||Cost Based Query optimization for Joins in Hive|
|Assignee||bharath v [ bharathv ]|
|Component/s||Statistics [ 12314312 ]|