Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-7003

DRAINING state of queues is not recovered after RM restart

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.9.0, 3.0.0-alpha4
    • 3.2.0, 3.1.1, 2.9.2, 3.0.3
    • capacityscheduler
    • None
    • Reviewed

    Description

      DRAINING state is a temporary state in RM memory, when queue state is set to be STOPPED but there are still some pending or active apps in it, the queue state will be changed to DRAINING instead of STOPPED after refreshing queues. We've encountered the problem that the state of this queue will aways be STOPPED after RM restarted, so that it can be removed at any time and leave some apps in a non-existing queue.
      To fix this problem, we could recover DRAINING state in the recovery process of pending/active apps. I will upload a patch with test case later for review.

      Attachments

        1. YARN-7003.004.patch
          7 kB
          Tao Yang
        2. YARN-7003.003.patch
          7 kB
          Tao Yang
        3. YARN-7003.002.patch
          7 kB
          Tao Yang
        4. YARN-7003.001.patch
          7 kB
          Tao Yang

        Activity

          People

            Tao Yang Tao Yang
            Tao Yang Tao Yang
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: