Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-4824

"filesystem/linux" isolator does not unmount orphaned persistent volumes

    XMLWordPrintableJSON

Details

    • Mesosphere Sprint 30
    • 2

    Description

      A persistent volume can be orphaned when:

      1. A framework registers with checkpointing enabled.
      2. The framework starts a task + a persistent volume.
      3. The agent exits. The task continues to run.
      4. Something wipes the agent's meta directory. This removes the checkpointed framework info from the agent.
      5. The agent comes back and recovers. The framework for the task is not found, so the task is considered orphaned now.

      The agent currently does not unmount the persistent volume, saying (with GLOG_v=1)

      I0229 23:55:42.078940  5635 linux.cpp:711] Ignoring cleanup request for unknown container: a35189d3-85d5-4d02-b568-67f675b6dc97
      

      Test implemented here: https://reviews.apache.org/r/44122/

      Attachments

        Activity

          People

            kaysoky Joseph Wu
            kaysoky Joseph Wu
            Jie Yu Jie Yu
            Votes:
            1 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: