[S2GRAPH-15] S2Lambda, speed and batch layers of the lambda architecture - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: To Do
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
- features

Description

Background

As the lambda architecture view, S2Graph provides a great real-time view with serving layer on HBase.
The input stream came from the REST API is stored to HBase, and it can be served by the graph query in real-time.

The stream, which is write-ahead log is also written to Kafka, it allows us to do a lot of things.
There are several works (or sub-projects) using this stream.

S2Counter - computes the real-time count by the combinations of properties using Kafka stream directly.
WalToHdfs - Kafka stream to the incremental view
S2ML - performs machine learning algorithm using the incremental view.
…

S2Lambda

Because the above works have been developed, respectively, they use different Spark versions and duplicated codes.
This causes difficulty of build and code reusability.

S2Lambda should be designed to solve this problem to support a general framework of speed and batch layers.

IMHO, first, A JSON-formatted job description is designed for compatible with both speed and batch layer.
then the S2Lambda is implemented by corresponding it.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

s2lambda.001.png
04/Jan/16 02:08
121 kB
Min-Seok Kim

Issue Links

links to

GitHub Pull Request #15

Activity

People

Assignee:: Unassigned

Reporter:: Min-Seok Kim

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 03/Jan/16 13:56

Updated:: 04/Feb/20 04:07

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

10m