Uploaded image for project: 'REEF'
  1. REEF
  2. REEF-976

Fix broken C# Tests caused by race condition of local RM

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.15
    • Component/s: REEF-Tests, REEF.NET
    • Labels:

      Description

      There is a race condition in REEF-Local-Runtime, and it can happen as follows:

      1. The Evaluator sends the DONE message and exits its process.
      2. The RM discovers Evaluator ends, sends DONE message to Driver.
      3. Driver first gets DONE message from RM before getting reading the DONE message from the Evaluator in its network queue.
      4. Driver calls FailedEvaluatorHandler, even though the Evaluator shuts down properly.

      This can be fixed by requiring an ACK from the Driver prior to letting the Evaluator exit its process.

        Attachments

        Issue Links

          Activity

            People

            • Assignee:
              afchung90 Andrew Chung
              Reporter:
              afchung90 Andrew Chung

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment