Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-5074

RM cycles through container ids for an app that is waiting for resources.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 2.7.2
    • None
    • resourcemanager
    • None

    Description

      /cc wangda, vinodkv

      This was observed on a cluster running a 2.7.x build. Here is the scenario :

      1. A YARN cluster has applications running that almost entirely consume the cluster, with little available resources.
      2. A new app is submitted - the resources required for the AM exceed what is available in the cluster. The app stays in the 'ACCEPTED' state till resources are available.
      3. Once resources are available and the AM container comes up, the AM container has a id that indicates that the RM has been cycling through containers. There are no errors in the logs of any kind. One example id for such an AM container is : container_e3788_1462916288781_0012_01_000302 . This indicates that while the app was in the 'ACCEPTED' state, the RM cycled through 301 containers.

      Attachments

        1. YARN-5074-test-case.patch
          6 kB
          Wangda Tan

        Issue Links

          Activity

            People

              Unassigned Unassigned
              sidharta-s Sidharta Seethana
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated: