Uploaded image for project: 'REEF (Retired)'
  1. REEF (Retired)
  2. REEF-976

Fix broken C# Tests caused by race condition of local RM

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.15
    • REEF-Tests, REEF.NET

    Description

      There is a race condition in REEF-Local-Runtime, and it can happen as follows:

      1. The Evaluator sends the DONE message and exits its process.
      2. The RM discovers Evaluator ends, sends DONE message to Driver.
      3. Driver first gets DONE message from RM before getting reading the DONE message from the Evaluator in its network queue.
      4. Driver calls FailedEvaluatorHandler, even though the Evaluator shuts down properly.

      This can be fixed by requiring an ACK from the Driver prior to letting the Evaluator exit its process.

      Attachments

        Issue Links

          Activity

            People

              afchung90 Andrew Chung
              afchung90 Andrew Chung
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: