Details
-
New Feature
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
None
-
None
-
None
-
ghx-label-8
Description
In Hive, we can create bucket tables, divide data in fine-grained ways, and publish data to different files based on bucket columns. Like this, we can make specific optimizations to the Query to speed up the Query.
I think it would be exciting for Impala to have support for bucket table creation and related optimizations.
The following document is a design document that supports the creation of bucket tables. If you are interested, welcome to give some suggestions.
Support Bucketed Table And Related Optimizations