I was wondering if we can have a OutPutFormat to Bulkload the data to Cassandra with Hadoop Job Written in Python.
I am having very complex Hadoop job written in Python which processes test data and generate structured data in sequential file. I read this data and stream it to cassandra using BulkOutPutFormat.
Is there any way that I can avoid writing to sequential file and directly process and stream data to Cassandra(With Hadoop Job written in python)?
What could be a possible solution for same?