Uploaded image for project: 'REEF (Retired)'
  1. REEF (Retired)
  2. REEF-726

Race condition with completed Containers

    XMLWordPrintableJSON

Details

    Description

      There is a race condition in which it's possible for a Container to be removed twice in YarnContainerManager. i.e. removeAndGet() is called both in onContainersCompleted (without a lock) and by the EvaluatorManager (with a lock on the EvaluatorManager).

      Attachments

        1. restart_failure.txt
          1.34 MB
          Andrew Chung
        2. completed_containers_race.txt
          5 kB
          Andrew Chung

        Issue Links

          Activity

            People

              afchung90 Andrew Chung
              afchung90 Andrew Chung
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: