Uploaded image for project: 'UIMA'
  1. UIMA
  2. UIMA-5057

DUCC Orchestrator (OR) handle down JD node

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 2.2.0-Ducc
    • Component/s: DUCC
    • Labels:
      None

      Description

      If a node that goes down happens to be a JD node then:
      1. cancel any running jobs whose JD was assigned there, and
      2. allocate a new JD node, if needed and possible

      Currently when the Agent hosting the JD of running Job is killed the Job hangs because the Agent is not there to carry out the Orchestrator directed clean-up procedures.

        Activity

        Hide
        lou.degenaro Lou DeGenaro added a comment -

        Whenever Orchestrator does a publication it calls org.apache.uima.ducc.orchestrator.jd.scheduler.JdScheduler.handle(IDuccWorkMap dwm). Here we consider expanding or contracting the JD share pool. But we fail to consider that an already allocated JD node has gone down. We add a new call to monitor() to consider this possibility.

        Note that we rely upon database updated by Resource Manager to determine node downness.

        Show
        lou.degenaro Lou DeGenaro added a comment - Whenever Orchestrator does a publication it calls org.apache.uima.ducc.orchestrator.jd.scheduler.JdScheduler.handle(IDuccWorkMap dwm). Here we consider expanding or contracting the JD share pool. But we fail to consider that an already allocated JD node has gone down. We add a new call to monitor() to consider this possibility. Note that we rely upon database updated by Resource Manager to determine node downness.

          People

          • Assignee:
            lou.degenaro Lou DeGenaro
            Reporter:
            lou.degenaro Lou DeGenaro
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development