Details
-
New Feature
-
Status: Resolved
-
Normal
-
Resolution: Fixed
-
None
-
None
Description
Hadoop Streaming is a framework that allows mapreduce jobs to be written in languages other than Java, by performing simple IPC on stdin/stdout.
Adding output support for Hadoop Streaming to Cassandra would mean that users could write very simple scripts in dynamic languages to load data into Cassandra. Once our Hadoop OutputFormat has stabilized a bit, we might also be able to this code to provide scalable bulk loading.
Attachments
Attachments
Issue Links
- blocks
-
CASSANDRA-1434 ColumnFamilyOutputFormat performs blocking writes for large batches
- Resolved
- is blocked by
-
CASSANDRA-1315 ColumnFamilyOutputFormat should use client API objects
- Resolved
- relates to
-
CASSANDRA-1497 Add input support for Hadoop Streaming
- Resolved