Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-7542

Fix issue that causes some Running Opportunistic Containers to be recovered as PAUSED

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 3.1.0, 2.10.0, 2.9.1, 3.0.1
    • None
    • None

    Description

      Steps to reproduce:

      • Start YARN cluster - Enable Opportunistic containers and set NM queue length to something > 10. Also Enable work preserving restart
      • Start an MR job (without opportunistic containers)
      • Kill the NM and restart it again.
      • In the logs - it shows that some of the containers are in SUSPENDED state - even though they are still running.

      sampada15 / kartheek, can you take a look at this ?

      Attachments

        1. YARN-7542.001.patch
          1 kB
          Sampada Dehankar

        Activity

          People

            sampada15 Sampada Dehankar
            asuresh Arun Suresh
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: