Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-15687

Potential test instabilities due to concurrent access to TaskSlotTable.

    XMLWordPrintableJSON

    Details

      Description

      Working on FLINK-14742 revealed that the problem with that test instability was the modification of the taskSlotTable of the TaskManager under test from multiple threads, namely the test thread and the main thread of the rpcEnpoint. This data-structure is not thread-safe and this should not happen.

      This anti-pattern seems to be repeated in multiple tests like most of the tests in the TaskExecutorSubmissionTest (look for the call to the TaskSlotTable.allocateSlot()). There we seem to call taskSlotTable.allocateSlot() and then tmGateway.submitTask() which is essentially accessing the slot table from within the main rpc-endpoint thread.

      This JIRA is just to investigate if this is also a problem in those tests or not.

      cc Till Rohrmann, Chesnay Schepler , Yang Wang

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              kkl0u Kostas Kloudas
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated: