[HIVE-5783] Native Parquet Support in Hive - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.13.0
Component/s: Serializers/Deserializers
Labels:
- Parquet

Release Note:
Added support for 'STORED AS PARQUET' and for setting parquet as the default storage engine.

Description

Problem Statement:

Hive would be easier to use if it had native Parquet support. Our organization, Criteo, uses Hive extensively. Therefore we built the Parquet Hive integration and would like to now contribute that integration to Hive.

About Parquet:

Parquet is a columnar storage format for Hadoop and integrates with many Hadoop ecosystem tools such as Thrift, Avro, Hadoop MapReduce, Cascading, Pig, Drill, Crunch, and Hive. Pig, Crunch, and Drill all contain native Parquet integration.

Changes Details:

Parquet was built with dependency management in mind and therefore only a single Parquet jar will be added as a dependency.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-5783.noprefix.patch
07/Feb/14 15:39
196 kB
Brock Noland
HIVE-5783.noprefix.patch
06/Feb/14 23:47
196 kB
Brock Noland
HIVE-5783.patch
08/Feb/14 16:43
196 kB
Brock Noland
HIVE-5783.patch
07/Feb/14 15:40
196 kB
Brock Noland
HIVE-5783.patch
06/Feb/14 23:47
196 kB
Brock Noland
HIVE-5783.patch
04/Feb/14 22:35
196 kB
Brock Noland
HIVE-5783.patch
04/Feb/14 20:34
197 kB
Brock Noland
HIVE-5783.patch
04/Feb/14 16:51
195 kB
Brock Noland
HIVE-5783.patch
04/Feb/14 16:25
194 kB
Brock Noland
HIVE-5783.patch
30/Jan/14 14:48
192 kB
Brock Noland
HIVE-5783.patch
28/Jan/14 19:08
172 kB
Brock Noland
HIVE-5783.patch
25/Jan/14 19:16
173 kB
Brock Noland
HIVE-5783.patch
24/Jan/14 17:56
173 kB
Brock Noland
HIVE-5783.patch
24/Jan/14 12:05
180 kB
Justin Coffey
HIVE-5783.patch
21/Jan/14 07:35
199 kB
Brock Noland
HIVE-5783.patch
20/Jan/14 22:22
199 kB
Brock Noland
HIVE-5783.patch
20/Jan/14 18:55
171 kB
Justin Coffey
HIVE-5783.patch
17/Jan/14 19:08
196 kB
Brock Noland
HIVE-5783.patch
06/Dec/13 22:15
9 kB
Xuefu Zhang

Issue Links

blocks

HIVE-6367 Implement Decimal in ParquetSerde

Closed

HIVE-6375 Fix CTAS for parquet

Resolved

HIVE-6366 Refactor some items in Hive Parquet

Open

HIVE-6384 Implement all Hive data types in Parquet

Resolved

is related to

HIVE-25296 Replace parquet-hadoop-bundle dependency with the actual parquet modules

Open

HIVE-5976 Decouple input formats from STORED as keywords

Closed

relates to

HIVE-5998 Add vectorized reader for Parquet files

Closed

HIVE-6368 Document parquet on hive wiki

Resolved

(1 is related to, 2 relates to)

Activity

People

Assignee:: Justin Coffey

Reporter:: Justin Coffey

Votes:: 1 Vote for this issue

Watchers:: 28 Start watching this issue

Dates

Created:: 08/Nov/13 18:40

Updated:: 29/Jun/21 10:14

Resolved:: 09/Feb/14 00:26