Uploaded image for project: 'VCL'
  1. VCL
  2. VCL-503

Add timeout to hung SSH processes

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.2.1
    • 2.4
    • vcld (backend)
    • None

    Description

      SSH processes issued from the management node to the computer being loaded occasionally hang for a very long time or indefinitely. This causes the reservation process to hang.

      This problem usually occurs soon after the computer begins to respond to SSH after it has been reloaded. vcld detects that it is responding and begins to issue commands. The SSH service/daemon is probably still being initialized on the computer. The SSH command hangs and does not fail because it makes an initial connection, a hiccup occurs, and the SSH service on the computer runs normally. Setting SSH options such as ServerAlive* or TCPKeepAlive doesn't help because the computer responds to these messages.

      Code should be added to timeout the SSH command process after a configurable amount of time.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            arkurth Andrew Kurth
            arkurth Andrew Kurth
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment