Uploaded image for project: 'Aurora'
  1. Aurora
  2. AURORA-669

Thermos runner collect_updates() gets stuck in a while loop when timeout is 0

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.6.0
    • Thermos
    • None
    • Aurora Q3 Sprint 2

    Description

      The following code in runner.py:collect_update() may result in an infinite while loop when the provided timeout is passed as 0:

       while True:
             ...
              if timeout and total_time >= timeout:
                break
             ...
      

      We have observed a case when the thermos runner gets stuck in a "deadlocked" state not reacting to SIGTERM with the last message in _main_.log as:

      D0827 15:35:26.022495 30886 runner.py:856] Run loop: Work to be done within 0.0s
      

      Attachments

        Activity

          People

            wickman Brian Wickman
            maximk Maxim Khutornenko
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: