Details
-
Bug
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
None
-
None
Description
Customers reported that their logs (sandbox files) are missing soon after the job completes. Mesos agent logs indicate that the files were GC-ed within minutes of container exit. Checking the host, there were a lot of old sandboxes dating back to Jan 2020. These are occupying a lot of space (~88% of all sandbox usage) and likely causing frequent GC of recently running containers.
Mesos does recognize these sandbox and try to schedule them for deletion
I0902 18:02:27.511576 467334 gc.cpp:95] Scheduling '/var/lib/mesos/meta/slaves/68caec4c-6ea5-44e7-9f8-fad1922d5-S162/frameworks/3dcc744f-016c-6579-9b82-6325402d2-9999/executors/fa00-29a3-4c47-95fd-808d52ac53-13-1' for gc -85.5641509780737weeks in the future
but the deletion seems to never happen.