Uploaded image for project: 'Hama'
  1. Hama
  2. HAMA-505 Fault Tolerant Job Processing
  3. HAMA-503

Chainable computations for fault tolerance

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.4.0
    • 0.5.0
    • bsp core
    • None

    Description

      refactor bsp() in allowing checkpointed messages to be recovered.

      ChiaHung Lin had a fancy idea in chaining superstep class to make the whole recovering more convenient and less error prone, or at least possible.

      A user does not define a BSP anymore, instead he defines a single superstep inside of a computation class. A user is able to chain these in a specific ordering. After each of this computation the framework calls sync() and exchanges the messages.

      Attachments

        1. HAMA-503.patch
          12 kB
          Thomas Jungblut

        Activity

          People

            thomas.jungblut Thomas Jungblut
            thomas.jungblut Thomas Jungblut
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: