Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
3.0.0
-
None
Description
Group by keys may be a mix of unique (or primary) keys and regular columns. In such cases presence of regular column won't alter cardinality of groups. So, if regular columns are not referenced later, they can be dropped from group by keys. Depending on operator tree may result in those columns not being read at all from disk in best case. In worst case, we will avoid shuffling and sorting regular columns from mapper to reducer, which still could be substantial CPU and network savings.
Attachments
Attachments
Issue Links
- duplicates
-
HIVE-17041 Aggregate elimination with UNIQUE and NOT NULL column
- Resolved
-
HIVE-17042 Expose NOT NULL constraint in optimizer so constant folding can take advantage of it
- Resolved
- links to