Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-2904

Add slave metric to count container launch failures

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.23.0
    • agent, statistics
    • None
    • Twitter Mesos Q2 Sprint 6
    • 1

    Description

      We have seen circumstances where a machine has been consistently unable to launch containers due to an inconsistent state (for example, unexpected network configuration). Adding a metric to track container launch failures will allow us to detect and alert on slaves in such a state.

      Attachments

        Activity

          People

            pbrett Paul Brett
            pbrett Paul Brett
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: