[CASSANDRA-1368] Add output support for Hadoop Streaming - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 0.7 beta 2
Component/s: None
Labels:
None

Description

Hadoop Streaming is a framework that allows mapreduce jobs to be written in languages other than Java, by performing simple IPC on stdin/stdout.

Adding output support for Hadoop Streaming to Cassandra would mean that users could write very simple scripts in dynamic languages to load data into Cassandra. Once our Hadoop OutputFormat has stabilized a bit, we might also be able to this code to provide scalable bulk loading.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

0002-Add-Avro-streaming-support-and-apply-deprecated-Outp.patch
01/Sep/10 14:43
30 kB
Stu Hood
0001-Switch-to-Cloudera-s-Distribution-of-Hadoop-for-bina.patch
01/Sep/10 14:43
3.09 MB
Stu Hood

Issue Links

blocks

CASSANDRA-1434 ColumnFamilyOutputFormat performs blocking writes for large batches

Resolved

is blocked by

CASSANDRA-1315 ColumnFamilyOutputFormat should use client API objects

Resolved

relates to

CASSANDRA-1497 Add input support for Hadoop Streaming

Resolved

Activity

People

Assignee:: Stu Hood

Reporter:: Stu Hood

Authors:: Stu Hood

Reviewers:: Jonathan Ellis

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 06/Aug/10 16:03

Updated:: 16/Apr/19 09:33

Resolved:: 03/Sep/10 21:12