Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.3.0
-
None
Description
PeriodicGraphCheckpointer was introduced for Latent Dirichlet Allocation (LDA), but it could be generalized to work with both Graphs and RDDs. It should be generalized and moved out of MLlib.
(For those who are not familiar with it, it tries to automatically handle persisting/unpersisting and checkpointing/removing checkpoint files in a lineage of Graphs.)
A generalized version might be immediately useful for:
- RandomForest
- Streaming
- GLMs
Attachments
Issue Links
- relates to
-
SPARK-5484 Pregel should checkpoint periodically to avoid StackOverflowError
- Resolved
-
SPARK-5955 Add checkpointInterval to ALS
- Resolved
-
SPARK-6684 Add checkpointing to GradientBoostedTrees
- Resolved
- links to