[DRILL-3333] Add support for auto-partitioning in parquet writer - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.1.0
Component/s: None
Labels:
None

Description

When a table is created with a partition by clause, the parquet writer will create separate files for the different partition values. The data will first be sorted by the partition keys, and the parquet writer will create new file when it encounters a new value for the partition columns.

When data is queried against the data that was created this way, partition pruning will work if the filter contains a partition column. And unlike directory based partitioning, no view is required, nor is it necessary to reference the dir* column names.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

DRILL-3333.patch
22/Jun/15 17:38
65 kB
Steven Phillips
DRILL-3333.patch
22/Jun/15 19:06
57 kB
Steven Phillips
DRILL-3333_2015-06-22_15:22:11.patch
22/Jun/15 22:22
57 kB
Steven Phillips
DRILL-3333_2015-06-23_17:38:32.patch
24/Jun/15 00:39
85 kB
Steven Phillips
DRILL-3333_tests.patch
25/Jun/15 20:50
10 kB
Steven Phillips

Activity

People

Assignee:: Steven Phillips

Reporter:: Steven Phillips

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 22/Jun/15 17:32

Updated:: 18/Sep/15 01:44

Resolved:: 30/Jun/15 21:08