Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-7003

DRAINING state of queues is not recovered after RM restart

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.9.0, 3.0.0-alpha4
    • Fix Version/s: 3.2.0, 3.1.1, 2.9.2, 3.0.3
    • Component/s: capacityscheduler
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      DRAINING state is a temporary state in RM memory, when queue state is set to be STOPPED but there are still some pending or active apps in it, the queue state will be changed to DRAINING instead of STOPPED after refreshing queues. We've encountered the problem that the state of this queue will aways be STOPPED after RM restarted, so that it can be removed at any time and leave some apps in a non-existing queue.
      To fix this problem, we could recover DRAINING state in the recovery process of pending/active apps. I will upload a patch with test case later for review.

        Attachments

        1. YARN-7003.001.patch
          7 kB
          Tao Yang
        2. YARN-7003.002.patch
          7 kB
          Tao Yang
        3. YARN-7003.003.patch
          7 kB
          Tao Yang
        4. YARN-7003.004.patch
          7 kB
          Tao Yang

          Activity

            People

            • Assignee:
              Tao Yang Tao Yang
              Reporter:
              Tao Yang Tao Yang
            • Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: