Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
Impala 2.11.0
-
None
-
None
-
None
-
ghx-label-1
Description
For tables with a large # of columns (>400), the metadata and processing time for computing stats on all columns becomes prohibitively expensive. It would increase performance and reduce catalog memory to be able to run stats only on a subset of columns that are frequently accessed.
Attachments
Issue Links
- duplicates
-
IMPALA-3562 Extend "compute stats" syntax to support a list of columns
- Resolved