Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-31135

ConfigMap DataSize went > 1 MB and cluster stopped working

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Not A Bug
    • kubernetes-operator-1.2.0
    • None
    • Kubernetes Operator
    • None

    Description

      I am Flink Operator to manage clusters. Flink version: 1.15.2. Flink jobs failed with the below error. It seems the config map size went beyond 1 MB (default size). 

      Since it is managed by the operator and config maps are not updated with any manual intervention, I suspect it could be an operator issue. 

       

      Caused by: io.fabric8.kubernetes.client.KubernetesClientException: Failure executing: PUT at: https://<IP>/api/v1/namespaces/<NS>/configmaps/<job>-config-map. Message: ConfigMap "<job>-config-map" is invalid: []: Too long: must have at most 1048576 bytes. Received status: Status(apiVersion=v1, code=422, details=StatusDetails(causes=[StatusCause(field=[], message=Too long: must have at most 1048576 bytes, reason=FieldValueTooLong, additionalProperties={})], group=null, kind=ConfigMap, name=<job>-config-map, retryAfterSeconds=null, uid=null, additionalProperties={}), kind=Status, message=ConfigMap "<job>-config-map" is invalid: []: Too long: must have at most 1048576 bytes, metadata=ListMeta(_continue=null, remainingItemCount=null, resourceVersion=null, selfLink=null, additionalProperties={}), reason=Invalid, status=Failure, additionalProperties={}).
      at io.fabric8.kubernetes.client.dsl.base.OperationSupport.requestFailure(OperationSupport.java:673) ~[flink-dist-1.15.2.jar:1.15.2]
      at io.fabric8.kubernetes.client.dsl.base.OperationSupport.assertResponseCode(OperationSupport.java:612) ~[flink-dist-1.15.2.jar:1.15.2]
      at io.fabric8.kubernetes.client.dsl.base.OperationSupport.handleResponse(OperationSupport.java:560) ~[flink-dist-1.15.2.jar:1.15.2]
      at io.fabric8.kubernetes.client.dsl.base.OperationSupport.handleResponse(OperationSupport.java:521) ~[flink-dist-1.15.2.jar:1.15.2]
      at io.fabric8.kubernetes.client.dsl.base.OperationSupport.handleUpdate(OperationSupport.java:347) ~[flink-dist-1.15.2.jar:1.15.2]
      at io.fabric8.kubernetes.client.dsl.base.OperationSupport.handleUpdate(OperationSupport.java:327) ~[flink-dist-1.15.2.jar:1.15.2]
      at io.fabric8.kubernetes.client.dsl.base.BaseOperation.handleUpdate(BaseOperation.java:781) ~[flink-dist-1.15.2.jar:1.15.2]
      at io.fabric8.kubernetes.client.dsl.base.HasMetadataOperation.lambda$replace$1(HasMetadataOperation.java:183) ~[flink-dist-1.15.2.jar:1.15.2]
      at io.fabric8.kubernetes.client.dsl.base.HasMetadataOperation.replace(HasMetadataOperation.java:188) ~[flink-dist-1.15.2.jar:1.15.2]
      at io.fabric8.kubernetes.client.dsl.base.HasMetadataOperation.replace(HasMetadataOperation.java:130) ~[flink-dist-1.15.2.jar:1.15.2]
      at io.fabric8.kubernetes.client.dsl.base.HasMetadataOperation.replace(HasMetadataOperation.java:41) ~[flink-dist-1.15.2.jar:1.15.2]
      at org.apache.flink.kubernetes.kubeclient.Fabric8FlinkKubeClient.lambda$attemptCheckAndUpdateConfigMap$11(Fabric8FlinkKubeClient.java:325) ~[flink-dist-1.15.2.jar:1.15.2]
      at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1700) ~[?:?]
      ... 3 more 

      Attachments

        1. flink--kubernetes-application-0-parked-logs-ingestion-644b80-b4bc58747-lc865.log.zip
          336 kB
          Zhihao Chen
        2. image-2023-05-03-13-51-21-685.png
          979 kB
          Zhihao Chen
        3. image-2023-05-03-13-50-54-783.png
          979 kB
          Zhihao Chen
        4. image-2023-05-03-13-47-51-440.png
          817 kB
          Zhihao Chen
        5. parked-logs-ingestion-644b80-3494e4c01b82eb7a75a76080974b41cd-config-map.yaml
          1.04 MB
          Zhihao Chen
        6. jobmanager_log.txt
          2.93 MB
          Zhihao Chen
        7. image-2023-04-19-09-48-19-089.png
          658 kB
          Zhihao Chen
        8. dump_cm.yaml
          1.04 MB
          Zhihao Chen

        Activity

          People

            Unassigned Unassigned
            sriramgr Sriram Ganesh
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: