[CTAKES-331] Add persistence layer to SparkStreaming - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: ctakes-clinical-pipeline
Labels:
None

Description

With the ability to grab tweets and process them scalable w/ SparkStreaming, we now should get a persistence layer - so that we can query data after it is ingested.

I can create a sink interfaces w/ a few options (solr,cassandra,...) for local processing, and then we can refactor the CTakes portion of the pipeline to run asynchronously to ingest.

Attachments

Issue Links

relates to

CTAKES-314 BigTop/Hadoop cTAKES integration

Open

Activity

People

Assignee:: Unassigned

Reporter:: jay vyas

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 11/Nov/14 01:44

Updated:: 17/Nov/14 14:49