Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Impala 4.0.0, Impala 4.1.0, Impala 4.2.0, Impala 4.3.0
-
None
-
ghx-label-14
Description
The bug occurs during remote spilling when writing spilled data to local buffers. In this process, if the files are not completely filled, such as when no more data is incoming at the end of spilling, there is a possibility of partial writes to the files, the files might be physically removed without properly releasing the associated file descriptor. This issue can be observed in cases like the one described below.
find /proc/*/fd -ls | grep '(deleted)'
288574785 0 lrwx------ 1 impala impala 64 Jan 3 14:24 /proc/x/fd/xxxx -> /opt/impala/scratch/impala-scratch/impala-scratch-xxxxxxxxx-xxxx-xxxx\ (deleted)
In such a scenario, the disk space occupied by the file may not be reclaimed because the file descriptor still maintains a reference to the file.