Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Fix Version/s: Initial Clearing
    • Component/s: VMWare
    • Labels:
      None

      Description

      The problem that occurred recently seems to be back again:

      {code}
      sudo: pam_authenticate: Authentication service cannot retrieve authentication info
      {code}

      Continuum appears to be down as a result. Are you able to apply the same fix from previously? Is there anything we can do to help avoid it again?

      I know a dist-upgrade is required once it stabilises - will that help?

        Issue Links

          Activity

          Brett Porter created issue -
          Gavin made changes -
          Field Original Value New Value
          Assignee Gavin [ ipv6guru ]
          Hide
          Gavin added a comment -
          Done.
          Show
          Gavin added a comment - Done.
          Gavin made changes -
          Status Waiting for Infra [ 10011 ] Closed [ 6 ]
          Assignee Gavin [ ipv6guru ]
          Resolution Fixed [ 1 ]
          Hide
          Brett Porter added a comment -
          Thanks Gavin. Is there anything I can do to fix this in future, or changes I can make to prevent it? Or has your fix eliminated the problem?
          Show
          Brett Porter added a comment - Thanks Gavin. Is there anything I can do to fix this in future, or changes I can make to prevent it? Or has your fix eliminated the problem?
          Hide
          Gavin added a comment -
          Hi Brett,

          Nothing you could have done, other than re-boot the VM.

          Its host, Erebus occasionally has issues and sends one or more of its VMs into r/o mode (damage control?).

          In any case, Erebus is the issue and needs looking at.
          Show
          Gavin added a comment - Hi Brett, Nothing you could have done, other than re-boot the VM. Its host, Erebus occasionally has issues and sends one or more of its VMs into r/o mode (damage control?). In any case, Erebus is the issue and needs looking at.
          Hide
          Brett Porter added a comment -
          Same problem again today... Continuum is down as mysql needs restarting and I can't sudo
          Show
          Brett Porter added a comment - Same problem again today... Continuum is down as mysql needs restarting and I can't sudo
          Brett Porter made changes -
          Resolution Fixed [ 1 ]
          Status Closed [ 6 ] Reopened [ 4 ]
          Hide
          Gavin added a comment -
          same fix
          Show
          Gavin added a comment - same fix
          Gavin made changes -
          Status Reopened [ 4 ] Closed [ 6 ]
          Resolution Fixed [ 1 ]
          Hide
          Brett Porter added a comment -
          and again...
          Show
          Brett Porter added a comment - and again...
          Brett Porter made changes -
          Resolution Fixed [ 1 ]
          Status Closed [ 6 ] Reopened [ 4 ]
          Hide
          Gavin added a comment -
          and fixed again via reboot and fsck
          Show
          Gavin added a comment - and fixed again via reboot and fsck
          Hide
          Gavin added a comment -
          leaving open to see if a more permanent fix can be arranged.
          Show
          Gavin added a comment - leaving open to see if a more permanent fix can be arranged.
          Gavin made changes -
          Status Reopened [ 4 ] Waiting for Infra [ 10011 ]
          Hide
          Brett Porter added a comment -
          Thanks Gav. It'd also be great if there was a way we could do a nagios check that detected it - since the current HTTP one passes (Continuum stays up), but the app is non-functional (since we lose MySQL when this happens). Do you have any suggestions?
          Show
          Brett Porter added a comment - Thanks Gav. It'd also be great if there was a way we could do a nagios check that detected it - since the current HTTP one passes (Continuum stays up), but the app is non-functional (since we lose MySQL when this happens). Do you have any suggestions?
          Brett Porter made changes -
          Link This issue blocks INFRA-6532 [ INFRA-6532 ]
          Hide
          Brett Porter added a comment -
          After discussion with Gav, we'll build a new VM more aligned with current practices. I'll move the Continuum install over, and we'll set it up behind an SSL reverse proxy as https://continuum-ci.apache.org/

          Show
          Brett Porter added a comment - After discussion with Gav, we'll build a new VM more aligned with current practices. I'll move the Continuum install over, and we'll set it up behind an SSL reverse proxy as https://continuum-ci.apache.org/
          Gavin made changes -
          Assignee Gavin [ ipv6guru ]
          Gavin made changes -
          Component/s VMWare [ 12311930 ]
          Hide
          Gavin added a comment -
          creating the vm now.
          Show
          Gavin added a comment - creating the vm now.
          Gavin made changes -
          Status Waiting for Infra [ 10011 ] In Progress [ 3 ]
          Hide
          Gavin added a comment -
          VM done, puppet managed. Still the proxy to sort but that can be done once bretts set up the application.
          Show
          Gavin added a comment - VM done, puppet managed. Still the proxy to sort but that can be done once bretts set up the application.
          Gavin made changes -
          Status In Progress [ 3 ] Closed [ 6 ]
          Assignee Gavin [ ipv6guru ]
          Resolution Fixed [ 1 ]
          Gavin made changes -
          Fix Version/s Initial Clearing [ 12325964 ]

            People

            • Assignee:
              Unassigned
              Reporter:
              Brett Porter
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development