Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-10232

Old sandboxes not being GC'ed caused frequent Mesos GC

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 1.12.0
    • agent
    • None

    Description

      Customers reported that their logs (sandbox files) are missing soon after the job completes. Mesos agent logs indicate that the files were GC-ed within minutes of container exit. Checking the host, there were a lot of old sandboxes dating back to Jan 2020. These are occupying a lot of space (~88% of all sandbox usage) and likely causing frequent GC of recently running containers. 

      Mesos does recognize these sandbox and try to schedule them for deletion

       I0902 18:02:27.511576 467334 gc.cpp:95] Scheduling '/var/lib/mesos/meta/slaves/68caec4c-6ea5-44e7-9f8-fad1922d5-S162/frameworks/3dcc744f-016c-6579-9b82-6325402d2-9999/executors/fa00-29a3-4c47-95fd-808d52ac53-13-1' for gc -85.5641509780737weeks in the future
      

      but the deletion seems to never happen.

      Attachments

        Activity

          People

            Unassigned Unassigned
            xiao66xiang HAO SU
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: