Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
1.0.0-Ducc
-
None
Description
When a node leaves because of ping timeout and then returns, the total share count isn't being adjusted correctly on return. This causes more subtractions than additions and the share count falls negative, rendering the cluster unschedulable.
Fix is in NodePool.nodearrive(), be sure to alwasys increment shares in all cases of a node returning.