[FLINK-20219] Rethink the HA related ZNodes/ConfigMap clean up for session cluster - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 1.12.0
Fix Version/s: None
Component/s: Deployment / Kubernetes, Deployment / Scripts, Runtime / Coordination
Labels:
- pull-request-available
- stale-assigned

Description

When I am testing the Kubernetes HA service, I realize that ConfigMap clean up for session cluster(both standalone and native) are not very easy.

For the native K8s session, we suggest our users to stop it via echo 'stop' | ./bin/kubernetes-session.sh -Dkubernetes.cluster-id=<ClusterID> -Dexecution.attached=true. Currently, it has the same effect with kubectl delete deploy <ClusterID>. This will not clean up the leader ConfigMaps(e.g. ResourceManager, Dispatcher, RestServer, JobManager). Even though there is no running jobs before stop, we still get some retained ConfigMaps. So when and how to clean up the retained ConfigMaps? Should the user do it manually? Or we could provide some utilities in Flink client.
For the standalone session, I think it is reasonable for the users to do the HA ConfigMap clean up manually.

We could use the following command to do the manually clean up.

kubectl delete cm --selector='app=<ClusterID>,configmap-type=high-availability'

Note: This is not a problem for Flink application cluster. Since we could do the clean up automatically when all the running jobs in the application reached terminal state(e.g. FAILED, CANCELED, FINISHED) and then destroy the Flink cluster.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

image-2022-02-22-10-54-03-439.png
22/Feb/22 02:54
76 kB
zhisheng

Issue Links

duplicates

FLINK-30813 Residual zk data when using the kubernetes session mode

Closed

relates to

FLINK-26750 HA cluster cleanup not complete

Open

links to

GitHub Pull Request #16121

mentioned in: Page Loading...

Activity

People

Assignee:: Yang Wang

Reporter:: Yang Wang

Votes:: 1 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 18/Nov/20 12:07

Updated:: 23/Feb/23 08:56