I was running Flink 1.8 and decided to upgrade to 1.10. I started to get OutOfMemory: Metaspace on Task nodes and they would shutdown.
Deploying 3-4 jobs was ok, deploying 5th job, the task manager would shutdown after a couple of minutes.
Some investigation shows, that the Flink startup scripts set this to 100m.
Setting taskmanager.memory.jvm-metaspace.size: 256m seems to alleviate the problem. Looking at the docs, this setting seems new. But it is not documented anywhere that the default is now 100m
I would like to add the exact same setup/jobs on Flink 1.8 was working fine.
Here is a conversation on the mailing list as well.
I think the upgrade/migration docs should indicate this and also maybe the default size.