Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-1336 [Umbrella] Work-preserving nodemanager restart
  3. YARN-2331

Distinguish shutdown during supervision vs. shutdown for rolling upgrade

    XMLWordPrintableJSON

Details

    Description

      When the NM is shutting down with restart support enabled there are scenarios we'd like to distinguish and behave accordingly:

      1. The NM is running under supervision. In that case containers should be preserved so the automatic restart can recover them.
      2. The NM is not running under supervision and a rolling upgrade is not being performed. In that case the shutdown should kill all containers since it is unlikely the NM will be restarted in a timely manner to recover them.
      3. The NM is not running under supervision and a rolling upgrade is being performed. In that case the shutdown should not kill all containers since a restart is imminent due to the rolling upgrade and the containers will be recovered.

      Attachments

        1. YARN-2331.patch
          12 kB
          Jason Darrell Lowe
        2. YARN-2331v2.patch
          14 kB
          Jason Darrell Lowe
        3. YARN-2331v3.patch
          13 kB
          Jason Darrell Lowe

        Activity

          People

            jlowe Jason Darrell Lowe
            jlowe Jason Darrell Lowe
            Votes:
            0 Vote for this issue
            Watchers:
            10 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: