Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-9934

Master does not handle returning unreachable agents as draining/deactivated

Attach filesAttach ScreenshotVotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.9.0
    • Component/s: master
    • Labels:
    • Epic Link:
    • Sprint:
      Foundations: RI-17 Sprint 52
    • Story Points:
      3

      Description

      The master has two code paths for handling agent reregistration messages, one culminating in Master::__reregisterSlave and the other in Master::_reregisterSlave. The two paths are not continuations of each other. Looks like we missed the double-underscore case in the initial implementation. This is the path that unreachable agents take, when/if they come back to the cluster. The result is that when unreachable agents are marked for draining, they do not get sent the appropriate message unless they are forced to reregister again (i.e. restarted manually).

        Attachments

          Activity

            People

            • Assignee:
              kaysoky Joseph Wu
              Reporter:
              kaysoky Joseph Wu
              Shepherd:
              Greg Mann

              Dates

              • Created:
                Updated:
                Resolved:

                Agile

                  Issue deployment