Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-940

Slave should checkpoint bootid after recovery instead of after registration

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.16.0
    • Fix Version/s: 0.16.0
    • Component/s: None
    • Labels:
      None

      Description

      Doing it after registration means that the slave could keep failing to recover in certain scenarios.

      Example scenario:

      --> A pre 0.16.0 slave was upgraded to 0.16.0
      --> After a slave roll it re-registered with the master and hence never wrote the boot id
      --> Now if machine reboots and slave info is incompatible the slave fails immediately during recovery.

        Attachments

          Activity

            People

            • Assignee:
              vinodkone Vinod Kone
              Reporter:
              vinodkone Vinod Kone
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: