Uploaded image for project: 'Edgent'
  1. Edgent
  2. EDGENT-396

JobMonitorApp restarts job 3 times more than it should

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • Apache Edgent 1.2.0
    • Runtime
    • None

    Description

      N.B. I don't think this bug affects IotProvider since I'm pretty sure that while IotProvider includes JobMonitorApp, it doesn't register a JobRegistryService so the monitor does nothing. JIRA forthcoming for that.

      JobMonitorAppTest exercises the app but it doesn't perform any validation that restarts were actually happening. Adding instrumentation / validation highlights that 3x the number of rebuilds/restarts are happening.

      appOne: buildCnt: 7 injectedFailureCnt: 2
      appTwo: buildCnt: 10 injectedFailureCnt: 3
      

      Further investigation identifies the JobMonitorApp's job event filtering as the problem. Each "failed" job ends up with 3 events that pass through the filter

      RUNNING, RUNNING, UNHEALTHY
      RUNNING, CLOSED, UNHEALTHY
      CLOSED, CLOSED, UNHEALTHY
      

      ... or something like that

      Attachments

        Activity

          People

            dlaboss Dale LaBossiere
            dlaboss Dale LaBossiere
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: