In our flink(1.6.3) product env, I often encounter a scene that yarn application can't stop when flink job failed in per-job yarn cluste mode, so I deeply analyzed the reason why it happened.
When a flink job fail, system will write an archive file to a FileSystem through method, then notify YarnJobClusterEntrypoint to shutDown. But, if throw exceptions during execution, it affect the following calls.
So I open FLINK-12247 to solve NEP bug when system write archive to FileSystem. But We still need to consider other exceptions, so we should catch Exception / Throwable not just IOExcetion.