During testing we saw a dead lock in the scheduler that stopped all scheduling.
The scheduler is doing two thing at the time:
- normal scheduling routine
- restoring reservations from a node
The scheduler is compiled by default with race detection and during the same test runs we also saw a data race logged.
Stack and data race log info are attached (cleaned up versions)