Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-10866

Queryable state can prevent cluster from starting

    XMLWordPrintableJSON

Details

    Description

      The KvStateServerImpl can currently prevent the TaskExecutor from starting.

      Currently, the QS server starts per default on port 9067. If this port is not free, then it fails and stops the whole initialization of the TaskExecutor. I think the QS server should not stop the TaskExecutor from starting.

      We should at least change the default port to 0 to avoid port conflicts. However, this will break all setups which don't explicitly set the QS port because now it either needs to be setup or extracted from the logs.

      Additionally, we should think about whether a QS server startup failure should lead to a TaskExecutor failure or simply be logged. Both approaches have pros and cons. Currently, a failing QS server will also affect users which don't want to use QS. If we tolerate failures in the QS server, then a user who wants to use QS might run into problems with state not being reachable.

      Attachments

        Issue Links

          Activity

            People

              eaglewatcher BoWang
              trohrmann Till Rohrmann
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 0.5h
                  0.5h