Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-9324

SingleLogicalSlot returns completed release future before slot is properly returned

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: 1.5.0, 1.6.0
    • Fix Version/s: 1.5.0
    • Component/s: Runtime / Coordination
    • Labels:

      Description

      The SingleLogicalSlot#releaseSlot method returns a future which is completed once the slot has been returned to the SlotOwner. Unfortunately, we don't wait for the SlotOwner's response to complete the future but complete it directly after the call has been made. This causes that the ExecutionGraph can get restarted in case of a recovery before all of its slots have been returned to the SlotPool. As a consequence, the allocation of the new tasks might require more than the max parallelism because of collisions with old tasks (in case of slot sharing).

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                trohrmann Till Rohrmann
                Reporter:
                trohrmann Till Rohrmann
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: