Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-5074

RM cycles through container ids for an app that is waiting for resources.

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 2.7.2
    • Fix Version/s: None
    • Component/s: resourcemanager
    • Labels:
      None

      Description

      /cc Wangda Tan, Vinod Kumar Vavilapalli

      This was observed on a cluster running a 2.7.x build. Here is the scenario :

      1. A YARN cluster has applications running that almost entirely consume the cluster, with little available resources.
      2. A new app is submitted - the resources required for the AM exceed what is available in the cluster. The app stays in the 'ACCEPTED' state till resources are available.
      3. Once resources are available and the AM container comes up, the AM container has a id that indicates that the RM has been cycling through containers. There are no errors in the logs of any kind. One example id for such an AM container is : container_e3788_1462916288781_0012_01_000302 . This indicates that while the app was in the 'ACCEPTED' state, the RM cycled through 301 containers.

        Attachments

        1. YARN-5074-test-case.patch
          6 kB
          Wangda Tan

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                sidharta-s Sidharta Seethana
              • Votes:
                0 Vote for this issue
                Watchers:
                9 Start watching this issue

                Dates

                • Created:
                  Updated: