[HIVE-4244] Make string dictionaries adaptive in ORC - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: File Formats
Labels:
None

Description

The ORC writer should adaptively switch between dictionary and direct encoding. I'd propose looking at the first 100,000 values in each column and decide whether there is sufficient loading in the dictionary to use dictionary encoding.

Attachments

Sub-Tasks

ORC Turn off dictionary encoding when number of distinct keys is greater than threshold

Closed

Kevin Wilfong

Activity

People

Assignee:: Kevin Wilfong

Reporter:: Owen O'Malley

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 28/Mar/13 15:38

Updated:: 17/May/13 16:41