Uploaded image for project: 'Airavata'
  1. Airavata
  2. AIRAVATA-2956

Possible race condition in job monitoring

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • helix implementation
    • None

    Description

      When Job submission task submits a job to a compute resource, it returns a job id and then it is saved in a zookeeper path for post workflow execution. But in some cases, job completes before those metadata is saved in zookeeper and then post workflow fails. 

      018-11-21 18:15:55,783 [main] INFO  o.a.a.h.i.w.PostWorkflowManager  - Processing job result of job id 9839 sent by EmailBasedProducer
      2018-11-21 18:15:55,785 [main] WARN  o.a.a.h.i.w.PostWorkflowManager  - Could not find a monitoring register for job id 9839
      2018-11-21 18:15:55,785 [main] INFO  o.a.a.h.i.w.PostWorkflowManager  - Status of processing 9839 : false

      Attachments

        Activity

          People

            dimuthuupe Dimuthu
            dimuthuupe Dimuthu
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: