Uploaded image for project: 'UIMA'
  1. UIMA
  2. UIMA-3939

DUCC RM: share count goes negative

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.0.0-Ducc
    • 1.1.0-Ducc
    • DUCC
    • None

    Description

      When a node leaves because of ping timeout and then returns, the total share count isn't being adjusted correctly on return. This causes more subtractions than additions and the share count falls negative, rendering the cluster unschedulable.

      Fix is in NodePool.nodearrive(), be sure to alwasys increment shares in all cases of a node returning.

      Attachments

        Activity

          People

            challngr Jim Challenger
            challngr Jim Challenger
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: