Description
I am trying to use BulkOutputFormat to stream the data from map of Hadoop job. I have set the cassandra related configuration using ConfigHelper ,Also have looked into Cassandra code seems Cassandra has taken care that it should not look for the cassandra.yaml file.
But still when I run the job i get the following error:
{
12/01/13 11:30:04 WARN mapred.JobClient: Use GenericOptionsParser for parsing the arguments. Applications should implement Tool for the same.
12/01/13 11:30:04 INFO input.FileInputFormat: Total input paths to process : 1
12/01/13 11:30:04 INFO mapred.JobClient: Running job: job_201201130910_0015
12/01/13 11:30:05 INFO mapred.JobClient: map 0% reduce 0%
12/01/13 11:30:23 INFO mapred.JobClient: Task Id : attempt_201201130910_0015_m_000000_0, Status : FAILED
java.lang.Throwable: Child Error
at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:271)
Caused by: java.io.IOException: Task process exit with nonzero status of 1.
at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:258)
attempt_201201130910_0015_m_000000_0: Cannot locate cassandra.yaml
attempt_201201130910_0015_m_000000_0: Fatal configuration error; unable to start server.
}
Also let me know how can i make this cassandra.yaml file available to Hadoop mapreduce job?
Attachments
Attachments
Issue Links
- blocks
-
CASSANDRA-3907 Support compression using BulkWriter
- Resolved
- is blocked by
-
CASSANDRA-3752 bulk loader no longer finds sstables
- Resolved
- relates to
-
CASSANDRA-3839 BulkOutputFormat binds to wrong client address when client is Dual-stack and server is IPv6
- Resolved
- supercedes
-
CASSANDRA-3839 BulkOutputFormat binds to wrong client address when client is Dual-stack and server is IPv6
- Resolved