Details
-
Umbrella
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
Don't Know (Unsure) - The default level
Description
Currently OODT excels at managing data, however; it is not as capable of processing really large data sets. The major drawbacks of OODT are: file-based data storage, and filesystem based io. Both of these drawbacks can be addressed by combining OODT with new stream-processing and cluster management technologies.
This effort is currently focused on combining OODT with the Berkley Data Analysis Stack to achieve these exact results.
Initial designs are captured in the attached slides. Focus on "Track 2" as this was decided to be the best approach.
Attachments
Attachments
1.
|
Create Mesos Backend for Resource Manager | Resolved | Michael Starch | ||||||||
2.
|
Create a Spark Runner for the Wengine | Closed | Unassigned | ||||||||
3.
|
Create Streaming Data Repository for Streaming OODT | Resolved | Michael Starch | ||||||||
4.
|
Allow Spark Jobs to Run in Resource Manager | Resolved | Michael Starch | ||||||||
5.
|
Create a Resource Manager Backend That Muliplexes | Closed | Unassigned | ||||||||
6.
|
Move Streaming Components to Seperate Top-Level Component | Resolved | Michael Starch |
|
|||||||
7.
|
Create Deployment Scripts Using Python Fabric for Streaming Components | Resolved | Michael Starch |
|