Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
Avoid paying the perf-overhead of collecting ColumnAccessInfo via the HiveRelFieldTrimmer when it is not necessary.
Currently, we only use the ColumnAccessInfo when we need to perform column based authorization or when it is requested explicitly by the user via the hive.stats.collect.scancols property.
The field trimmer is an expensive operation and can become pretty slow for certain use-cases/queries so skipping some calls when it is not necessary is in general beneficial and can notably improve the compilation time for some queries.
Attachments
Issue Links
- relates to
-
HIVE-28582 OOM when compiling query with many GROUP BY columns aliased multiple times
- Open
- links to