Uploaded image for project: 'REEF'
  1. REEF
  2. REEF-976

Fix broken C# Tests caused by race condition of local RM

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.15
    • Component/s: REEF-Tests, REEF.NET
    • Labels:

      Description

      There is a race condition in REEF-Local-Runtime, and it can happen as follows:

      1. The Evaluator sends the DONE message and exits its process.
      2. The RM discovers Evaluator ends, sends DONE message to Driver.
      3. Driver first gets DONE message from RM before getting reading the DONE message from the Evaluator in its network queue.
      4. Driver calls FailedEvaluatorHandler, even though the Evaluator shuts down properly.

      This can be fixed by requiring an ACK from the Driver prior to letting the Evaluator exit its process.

        Attachments

          Issue Links

          There are no Sub-Tasks for this issue.

            Activity

              People

              • Assignee:
                afchung90 Andrew Chung
                Reporter:
                afchung90 Andrew Chung
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: