Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
With the ability to grab tweets and process them scalable w/ SparkStreaming, we now should get a persistence layer - so that we can query data after it is ingested.
I can create a sink interfaces w/ a few options (solr,cassandra,...) for local processing, and then we can refactor the CTakes portion of the pipeline to run asynchronously to ingest.
Attachments
Issue Links
- relates to
-
CTAKES-314 BigTop/Hadoop cTAKES integration
- Open