Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
In this talk, the main problem the speaker encountered was that containers were killed after a network partition. It's not clear what scheduler was being used but this would not have happened if the framework enabled checkpointing.
Consider enabling framework checkpointing by default.