[HIVE-539] Support range bucketing of hive tables/partitions - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 0.4.0
Fix Version/s: None
Component/s: Metastore, Query Processor
Labels:
None

Description

Hive uses hash partitioner to distribute keys to reducers and thus creating hash bucketed tables/partitions. There are some cases where range partitioning will help in further query processing such as joins/filters.

Terasort (http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/examples/terasort/package-summary.html) seems to have implemented a sampling based range partitioner and Hive can reuse this or implement something similar.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Prasad Chakka

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 03/Jun/09 20:53

Updated:: 03/Jun/09 20:53