Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-7231

SlotSharingGroups are not always released in time for new restarts

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 1.3.1
    • 1.3.2, 1.4.0
    • Runtime / Coordination
    • None

    Description

      In the case where there are not enough resources to schedule the streaming program, a race condition can lead to a sequence of the following errors:

      java.lang.IllegalStateException: SlotSharingGroup cannot clear task assignment, group still has allocated resources.
      

      This eventually recovers, but may involve many fast restart attempts before doing so.

      The root cause is that slots are not cleared before the next restart attempt.

      Attachments

        Issue Links

          Activity

            People

              sewen Stephan Ewen
              sewen Stephan Ewen
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: