[HIVE-16295] Add support for using Hadoop's S3A OutputCommitter - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Patch Available
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Target Version/s:

3.1.0

Description

Hive doesn't have integration with Hadoop's OutputCommitter, it uses a NullOutputCommitter and uses its own commit logic spread across FileSinkOperator, MoveTask, and Hive.

The Hadoop community is building an OutputCommitter that integrates with S3Guard and does a safe, coordinate commit of data on S3 inside individual tasks (~~HADOOP-13786~~). If Hive can integrate with this new OutputCommitter there would be a lot of benefits to Hive-on-S3:

Data is only written once; directly committing data at a task level means no renames are necessary
The commit is done safely, in a coordinated manner; duplicate tasks (from task retries or speculative execution) should not step on each other

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-16295.1.WIP.patch
23/Apr/18 22:00
44 kB
Sahil Takiar
HIVE-16295.2.WIP.patch
24/Apr/18 12:46
44 kB
Sahil Takiar
HIVE-16295.3.WIP.patch
01/May/18 20:56
45 kB
Sahil Takiar
HIVE-16295.4.patch
05/May/18 01:22
110 kB
Sahil Takiar
HIVE-16295.5.patch
17/May/18 21:28
118 kB
Sahil Takiar
HIVE-16295.6.patch
30/May/18 22:28
124 kB
Sahil Takiar
HIVE-16295.7.patch
05/Jun/18 14:40
124 kB
Sahil Takiar
HIVE-16295.8.patch
10/Jul/18 23:12
133 kB
Sahil Takiar
HIVE-16295.9.patch
24/Jul/18 18:43
134 kB
Sahil Takiar

Issue Links

is blocked by

HIVE-19217 Upgrade to Hadoop 3.1.0

Closed

HIVE-18319 Upgrade to Hadoop 3.0.0

Closed

is related to

HADOOP-15421 Stabilise/formalise the JSON _SUCCESS format used in the S3A committers

Resolved

HADOOP-13786 Add S3A committers for zero-rename commits to S3 endpoints

Resolved

relates to

HIVE-19321 Dynamic Partitioning Integration with Hadoop's S3A OutputCommitter

Open

links to

(1 links to)

Activity

People

Assignee:: Unassigned

Reporter:: Sahil Takiar

Votes:: 0 Vote for this issue

Watchers:: 18 Start watching this issue

Dates

Created:: 24/Mar/17 20:20

Updated:: 03/Jan/20 17:11