Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
When the master sends a CheckpointResourcesMessage to the agent, the agent attempts to rmdir the persistent volume for a DESTROY operation (if it existed before, and is no longer in the updated checkpoint in CheckpointResourcesMessage).
If the slave restarts before the operation finishes, the disk space can be leaked because a reattempt of a rmdir is not done (since the checkpoint is already updated).
Subsequently, a CREATE on the same path could result in leaking of the data to another framework (since the directory was not rm-ed) since the CREATE operation is successful even if the root directory exists and the contents of that directory is not empty.