[HIVE-2814] Can we have a feature to disable creating empty buckets on a larger number of buckets creates? - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Minor
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
- hive
- newbie

Tags:
hive bucketing

Description

When we create buckets on a larger datasets, its not often that all the partitions have same number of buckets so we choose the largest possible number to capture the buckets mostly.

It results into creating lot of empty buckets, which might be an overhead of hadoop as well as for hive queries.
Also it takes a lot of time to just create empty buckets.

Is there a way where I can say do not create empty buckets?

Attachments

Activity

People

Assignee:: Mani M

Reporter:: Nitin Pawar

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 22/Feb/12 13:52

Updated:: 09/Jan/19 11:06