Uploaded image for project: 'REEF (Retired)'
  1. REEF (Retired)
  2. REEF-1677

IMRU: Evaluators failed during WaitingForEvaluator phase don't count towards MaximumNumberOfEvaluatorFailures limit

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.16
    • IMRU

    Description

      In IMRUDriver, we keep track of the list of failed evaluator ids, and use its length when we need to find out how many evaluators failed. However, if evaluator failed in WaitingForEvaluator state we immediately remove it from this list and request another evaluator, effectively forgetting about the failure. Thus, even with lots of evaluators failing at this stage we'll never hit MaximumNumberOfEvaluatorFailures limit, and will keep requesting new evaluators indefinitely.

      I think we should just remove this list (with the additional benefit of reduced memory consumption) and replace it with a single counter which is never decremented. We're only using the values in the list for sanity checks.

      Attachments

        Issue Links

          Activity

            People

              MariiaMykhailova Mariia Mykhailova
              MariiaMykhailova Mariia Mykhailova
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: