We face a problem when we try to use flink on k8s to execute task with udf/udtf. When we finished or canceled a job and submit a same job, the old TM will be not reachable and restart. Why the TM have to be restart? In session mode, the TM should be reused by JM. Moreover, if we off restart strategy, this task will fail.
First submit, the job will running:
But, cancel it and submit the same:
Internal server error, but in k8s the pod is running.