[HIVE-25296] Replace parquet-hadoop-bundle dependency with the actual parquet modules - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
- pull-request-available

Target Version/s:

4.0.0

Description

The parquet-hadoop-bundle is not a real dependency but a mere packaging
of three parquet modules to create an uber jar. The Parquet community
created this artificial module on demand by ~~HIVE-5783~~ but the
benefits if any are unclear.

On the contrary using the uber dependency has some drawbacks:

Parquet souce code cannot be attached easily in IDEs which makes debugging sessions cumbersome.
Finding concrete dependencies with Parquet is not possible just by inspecting the pom files.
Extra maintenance cost for the Parquet community adding additional verification steps during a release.

The goal of this JIRA is to replace the uber dependency with concrete dependencies to the respective modules:

parquet-common
parquet-column
parquet-hadoop

Attachments

Issue Links

relates to

HIVE-5783 Native Parquet Support in Hive

Resolved

links to

GitHub Pull Request #2288

Activity

People

Assignee:: Stamatis Zampetakis

Reporter:: Stamatis Zampetakis

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 29/Jun/21 10:14

Updated:: 21/Oct/22 07:20

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

40m