Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-8994

Fix race condition between move app and queue cleanup in Fair Scheduler

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.2.0
    • 3.3.0
    • fairscheduler
    • None
    • Reviewed

    Description

      Similar to YARN-8990 and also introduced by YARN-8191 there is a race condition while moving an application. The pre-move check looks for the queue and when it finds the queue it progresses. The real move then retrieves the queue and does further check before updating the app and queues.

      The move uses the retrieved queue object but the queue could have become empty while checks are performed. If the cleanup runs at that same time the app will be moved to a deleted queue and lost.

      Attachments

        1. YARN-8994.001.patch
          5 kB
          wilfreds#1

        Issue Links

          Activity

            People

              wilfreds Wilfred Spiegelenburg
              wilfreds Wilfred Spiegelenburg
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: