Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-19806

Job may try to leave SUSPENDED state in ExecutionGraph#failJob()

    XMLWordPrintableJSON

Details

    Description

      SUSPENDED is a terminal state which a job is not supposed to leave this state once entering. However, ExecutionGraph#failJob() did not check it and may try to transition a job out from SUSPENDED state. This will cause unexpected errors and may lead to JM crash.
      The problem can be visible if we rework ExecutionGraphSuspendTest to be based on DefaultScheduler.
      We should harden the check in ExecutionGraph#failJob().

      Attachments

        Issue Links

          Activity

            People

              zhuzh Zhu Zhu
              zhuzh Zhu Zhu
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: