Details
Description
With the finalized abstraction of the Mahout DSL plans from the backend operations (MAHOUT-1529), it should be possible to integrate further backends for the Mahout DSL. Apache Flink would be a suitable candidate to act as a good execution backend.
With respect to the implementation, the biggest difference between Spark and Flink at the moment is probably the incremental rollout of plans, which is triggered by Spark's actions and which is not supported by Flink yet. However, the Flink community is working on this issue. For the moment, it should be possible to circumvent this problem by writing intermediate results required by an action to HDFS and reading from there.
Attachments
Issue Links
- incorporates
-
MAHOUT-1809 Failing tests in Flink-bindings: dals and dspca
- Closed
-
MAHOUT-1810 Failing test in flink-bindings: A + B Identically partitioned (mapBlock Checkpointing issue)
- Closed
-
MAHOUT-1815 dsqDist(X,Y) and dsqDist(X) failing in flink tests.
- Closed
-
MAHOUT-1811 Fix calculation of second norm of DRM in Flink
- Closed
-
MAHOUT-1812 Implement drmParallelizeEmptyLong(...) in flink Bindings
- Closed
-
MAHOUT-1804 Implement drmParallelizeWithRowLabels(...) in flink
- Closed
-
MAHOUT-1805 Implement allreduceBlock(...) in flink bindings
- Closed
-
MAHOUT-1814 Implement drm2intKeyed in flink bindings
- Closed
-
MAHOUT-1701 Mahout DSL for Flink: implement AtB ABt and AtA operators
- Closed
-
MAHOUT-1702 Mahout DSL for Flink: implement element-wise operators
- Closed
-
MAHOUT-1703 Mahout DSL for Flink: implement cbind and rbind
- Closed
-
MAHOUT-1709 Mahout DSL for Flink: implement slicing
- Closed
-
MAHOUT-1710 Mahout DSL for Flink: implement right in-core matrix multiplication
- Closed
-
MAHOUT-1711 Mahout DSL for Flink: implement broadcasting
- Closed
-
MAHOUT-1712 Mahout DSL for Flink: implement operators At, Ax, Atx
- Closed
-
MAHOUT-1734 Mahout DSL for Flink: implement I/O
- Closed
-
MAHOUT-1747 Mahout DSL for Flink: add support for different types of indexes (String, long, etc)
- Closed
-
MAHOUT-1748 Mahout DSL for Flink: switch to Flink Scala API
- Closed
-
MAHOUT-1755 Mahout DSL for Flink: Flush intermediate results to FS
- Closed
-
MAHOUT-1764 Mahout DSL for Flink: Add standard backend tests for Flink
- Closed
-
MAHOUT-1765 Mahout DSL for Flink: Add some documentation about Flink backend
- Closed
- relates to
-
FLINK-393 Create Stratosphere Machine Learning Library
- Resolved