Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-10434

Blob file not removed after job execution.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Abandoned
    • 1.5.2
    • None
    • Runtime / Coordination
    • None
    • Flink container running on Yarn.

    Description

      We have Flink container running on Yarn, and we're using YarnClusterClient for job submission. After successfully/failed job execution it looks like blob file for that job is deleted, but there is still handle from Flink process to that file. As a result the file is
      not removed from machine, until we restart the whole Flink container.

      From the size comparison it looks like mentioned file is actually our jar with Flink job.

      This is quite big problem for us, as we are submitting many jobs, every submission upload new jar and created new blob file, which is never removed from disc until we restart container. We already faced out of disc space.

      Results of lsof are:
      During job execution:
      lsof /flinkDir | grep job_dbafb671b0d60ed8a8ec2651fe59303b
      java 11883 yarn mem REG 253,2 112384928 109973177
      /flinkDir/yarn/../application_1536668870638_5555/blobStore-a1bcdbd4-5388-4c56-8052-6051f5af38dd/job_dbafb671b0d60ed8a8ec2651fe59303b/blob_p-8771d9ccac35e28d8571ac8957feaaecdebaeadd-7748aec7fe7369ca26181d0f94b1a578
      java 11883 yarn 1837r REG 253,2 112384928 109973177
      /flinkDir/yarn/../application_1536668870638_5555/blobStore-a1bcdbd4-5388-4c56-8052-6051f5af38dd/job_dbafb671b0d60ed8a8ec2651fe59303b/blob_p-8771d9ccac35e28d8571ac8957feaaecdebaeadd-7748aec7fe7369ca26181d0f94b1a578

      After job execution:
      lsof /flinkDir | grep job_dbafb671b0d60ed8a8ec2651fe59303b
      java 11883 yarn DEL REG 253,2 109973177
      /flinkDir/yarn/../application_1536668870638_5555/blobStore-a1bcdbd4-5388-4c56-8052-6051f5af38dd/job_dbafb671b0d60ed8a8ec2651fe59303b/blob_p-8771d9ccac35e28d8571ac8957feaaecdebaeadd-7748aec7fe7369ca26181d0f94b1a578
      java 11883 yarn 1837r REG 253,2 112384928 109973177
      /flinkDir/yarn/../application_1536668870638_5555/blobStore-a1bcdbd4-5388-4c56-8052-6051f5af38dd/job_dbafb671b0d60ed8a8ec2651fe59303b/blob_p-8771d9ccac35e28d8571ac8957feaaecdebaeadd-7748aec7fe7369ca26181d0f94b1a578
      (deleted)

      After restarting Flink container this handle disappeared.

      Attachments

        Activity

          People

            Unassigned Unassigned
            piotr.szczepanek Piotr Szczepanek
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: