[NIFI-1868] Add support for Hive Streaming - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.0.0, 1.0.0-Beta
Component/s: None
Labels:
None

Description

Traditionally adding new data into Hive requires gathering a large amount of data onto HDFS and then periodically adding a new partition. This is essentially a “batch insertion”. Insertion of new data into an existing partition is not permitted. Hive Streaming API allows data to be pumped continuously into Hive. The incoming data can be continuously committed in small batches of records into an existing Hive partition or table. Once data is committed it becomes immediately visible to all Hive queries initiated subsequently.

This case is to add a PutHiveStreaming processor to NiFi, to leverage the Hive Streaming API to allow continuous streaming of data into a Hive partition/table.

Attachments

Issue Links

supercedes

NIFI-2448 Hive Processors depend on too recent a Hive version

Resolved

links to

GitHub Pull Request #434

GitHub Pull Request #706

Activity

People

Assignee:: Matt Burgess

Reporter:: Matt Burgess

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 11/May/16 20:36

Updated:: 04/Aug/16 21:23

Resolved:: 04/Aug/16 14:07