Uploaded image for project: 'Aurora'
  1. Aurora
  2. AURORA-1353

MesosPathDetector double-counts checkpoint roots

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.9.0
    • Observer
    • None
    • Twitter Aurora Q2'15 Sprint 5
    • 2

    Description

      This manifests itself in the observer ballooning in resource usage. The TaskDetector finds two copies of every task because of the 'latest' symlink being a duplicate of the actual directory, so we get race conditions in on_active/on_finished inside src/main/python/apache/thermos/observer/task_observer.py that cause it to hold onto TaskResourceMonitors which tend to be pretty expensive to run.

      Attachments

        Activity

          People

            wickman Brian Wickman
            wickman Brian Wickman
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: