Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-7542

Fix issue that causes some Running Opportunistic Containers to be recovered as PAUSED

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 3.1.0, 2.9.1, 3.0.1
    • Component/s: None
    • Labels:
      None
    • Target Version/s:

      Description

      Steps to reproduce:

      • Start YARN cluster - Enable Opportunistic containers and set NM queue length to something > 10. Also Enable work preserving restart
      • Start an MR job (without opportunistic containers)
      • Kill the NM and restart it again.
      • In the logs - it shows that some of the containers are in SUSPENDED state - even though they are still running.

      Sampada Dehankar / kartheek muthyala, can you take a look at this ?

        Attachments

        1. YARN-7542.001.patch
          1 kB
          Sampada Dehankar

          Activity

            People

            • Assignee:
              sampada15 Sampada Dehankar
              Reporter:
              asuresh Arun Suresh
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: