Description
If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
https://cwiki.apache.org/Hive/listbucketing.html
This jira issue will track DML change for the feature:
1. single skewed column
2. manual load data
Attachments
Attachments
Issue Links
- is related to
-
HIVE-3734 Static partition DML create duplicate files and records
- Resolved
-
HIVE-3451 map-reduce jobs does not work for a partition containing sub-directories
- Closed
-
HIVE-3026 List Bucketing in Hive
- Resolved
-
HIVE-3072 Hive List Bucketing - DDL support
- Closed
-
HIVE-3649 Hive List Bucketing - enhance DDL to specify list bucketing table
- Closed
-
HIVE-3601 Hive List Bucketing - add Skewed Information to "explain extended insert overwrite"
- Open
-
HIVE-3650 Hive List Bucketing - validation
- Open
- relates to
-
HIVE-3554 Hive List Bucketing - Query logic
- Closed